Tan0004102 (gene) Snake gourd v1

Overview
NameTan0004102
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionU1 small nuclear ribonucleoprotein C
LocationLG06: 7597251 .. 7601323 (-)
RNA-Seq ExpressionTan0004102
SyntenyTan0004102
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTTCTTTGCAAAAAAAGACTAGAAGTCTCTCTCTTCTTCGTTCTCGCAGAGTCGCTGATAGGGTTTTTGGTGGGTCTGCCATTGTTGCTTCTTCTTTGAGCTACCGCTATGCCTCGGTTCGTTCTTCCATCCCCATTTTCTTCTATGCCTCATCGTCTCATCCCTCAATCAAATTTACTTCTCTCTCTATTTCTATCTACAGTTAGTGATTTGATATTGTTGTATATATGCAGGTATTATTGTGACTATTGTGACACATATCTGACCCATGATTCTGTAAGTCATGTTCTTACATTCAGATGGCAATGCCATTTCCTTTTCGTTTGATCATCGCCCACTTGCTTTTTAGGACTTTATTTTTTATTTTTTATTTTTCGGGGTTTCTTTGTTGATGCTGCAGTTCCCTGTCTTTGTGATGGGGAATGTTTTTAGGTTAATGATTTTTGTATTTTTAATGCACCTATCAGTTGCTCGACAAAATTGTAGGAAATTTTGGGTTGATACGACTGTGGTTGGTGTTGGGATTGGATCGAATTCTTATATTTATTCTCAATGGGTGAAATCGACTGTTTGTAGTTTGTATTTTTTATGAATTCTAGATTTCAGTGTTCAGGATGTTGAGCTTATGCAGATTGGTTTGTGTTTATGTTTGTGTGTTTCTTTTCCCCCCGCGAGAAGCAAGAACGTTTTGCTGTTAATTGATAAGTTACAATATTAGAGCTCCTTCTTTTAGACTAGAAACTATTGAACCCGAAGCCGAAACTAAAAATTACATGTCATGTGAACAAATGATTAGAAGGCTTTTTGCCAAAGTGAGCTTAGTTAAGTGGTTATTGGCATGCACCCTCGACCAAAGAGGTCACAGGTTCGAATCTCCCCCACCCCCCAATGTTGTTGAACTAAATAAATTATTAGAAGGCTTCCCAACTCTTAAAAAACAAGTAATTGTGAAAATGTTTTAGGAAGAGCACTCCAAGAAGCCGTTCAATTCTTTATCAATGTGCTTTCTTCCTCCCTCCTATTTGAAGCCTATGATTTCTTTCCGACTAAATATCCTAAATCACAGAACCACCATAGGTTGGCCTAGTGATAATTGAAGCCAATGAAGAAACAAGTGAGTTTAGGGGAAATTAGTTTAAACCAAGGTGGCCACTTTTCTAGGATTTTGAAATCCGACAAGTTTTCTTAGCACAACCAAATGTCATAGGGTCAAGGAGTTAATTGCCTCTTGACATTAGGCAAGGTGTGCCAAAGTGAGCTTCGACACTAAAAAAAGCCTAAACCATGGCTTTAATAAATGGAAACCCATAAACTCTTGCTTTCAAATCAAACATGCCAAGCCTCTGGTTTCTTTTCTTTCAAACTAAGATTGGTTTGTACAATCTTTAATGGTTTATTTCAATTGTACTTGATTCTATGTTCCAGAAACAATAGTATTTAGTGAGTTATCAATGTCTAATATTGCTCATATTTTGTTTTATCGGTATCTCAGCCATCTGTGAGGAAGCAGCATAATGCAGGCTACAAACATAAGGTATCCTAATCTTCCCCACGTCTCAATTTAGCATGGTATCCCTTATTCCGAGGTTGAGTTTCAAAATTAAATTTAATAGAAATTTTATGTTTTTTAAAAACAAAACCACTTTTAGACTTAATAGAACTTGGTTGGTTACTTGGTTATAGTTTCAGAAACACTTATGAAGGTGAAAAGTTAGAGGTAAGATTTTATTGTTTTTCAGAGGTTGCTTTCTGTTTACTGAAATATGATAAAGATTAAGATTAGTGGCTTTAAGTGTATTGTAATTGTAACTTTCATATCTTGAAATGAGAAGATAAAATTTAAAGAAATGTGGAAGCACTAAAAATATCTTTCCATCCATTCATCCATCCAAAAGAACTTGTTTTTAAAAGATTTGAGAGTACTTTTTTTTAAAAGATTTGAGAGTACTTTTAAGAAAAAATGGACAACAAATCTTAGTTCCTGATTTATGGATGCACTCCAGGCAAACGTGCGATCATACTATCAGCAATTTGAGGAGCAACAAACCCAAAATTTAATTGACCAGAGGATCAAAGAACATCTGGGTCAAGCAGCAGCATTCCAGCAGGTTGGTGCAGCCTACAATCAGCATTTACTCGGCCAAAGACCTCGTCTTCCTGTACTACCTACTCCTGTAATGCCGGGAGGTGCCCCGGGATTACTGCCCGGAATTAGGCCTCCAGTTTTGCCAAGACCAGTTCCTGGTGCTCCAGGTAAAAATCACTGATTTTATACCTTCGAGATTTAGTAATTTTTTATGCACATCCCTTTAACTTTGATTGTGTCATTTCACAATTTGCATTTCTAACTAAAACAACCTTGTTATGCTTTTTGATAAGTAAAATCTGATAGTCAAAGGAATATGCAGCTTGAATATTGGAGATAGTTCTCTAGACATAAATAAATGCCCAAGTAGAATTTGACACATCTCTGTCTGGTTAAAATTATTTCTGTTGGTATAGAAATGATGTCTATTCCTTTCTTTTCAACAAACAATAATTGAAATTTCATTAGAATATGACCTGGAACTGAAAGATTATGAGATGTGAGCAATTTAGATTTTCTAGCTACTACTATCTCTTGAGGCTCTTGATCCAACTAATCCTCAAAATCTCCATGCCATTATTAGGGGGTGTAAGTGGTTGGTTCTTGGCAATAACCCTCACCAATCCGACATATCAGTTTTAGAAAAATTAAAAATTGATGCCAAAATCAATTTTTTTAAAAAACAACAGCAAGTCGACTGACTATTTTTCGATTTTTTGAATTTACCAAACCAAAACAACTAACAAGTATACTCGAACCAAACAAATCGACTTATGTCAGTTCTTAGTTCTTATTGATTTTTGGCTTGCAGTCCTTGTCATTACTGAATGGAATTAGAGGTAACCAATGCAGTAGGTAGGATAAATTATACAAAATACTCATGAACTTTGCCATTATGCCCCTAGACTTTGAATTGTTTCATTTTTATCTTGGACTTTGAAATTTGTTCCAAAAATACCTTTGAACTTTGACCTTGTTCCAAAAAATATATTTTTCATTAAAATAAATGATGGAACTTGACATAATAGCTAATGTGTAAAAATTGGAAAGCGTACGCGACTGTTATTTTACTACTATATAGAAGATTGAGGTCGTTGTTATCATTTATTTCATCTCAAAAAATATCATTAAATCATTTATATAATCAATTTGTATACATTAGTTGTCACGTCAATATATGTGAATGATGTTGTAAATGATATCACGTATGCATTTTAATTTTACACATTAGCTGCCACATCAATTTTAATGAAAGAACTCTCCAAAGGTATTTTTGAAACACATTTCAAAGTTCATCAAAGGTAGAATTGAAACTTTTCGAAGTCTAGAGACGTAATTAAAGCCTACGTTTCAGAGTTACTTTGCCTAAGCCAGCTTATTTAGCTATACGTATCAGATTGATTGAACTCCTCCTTTGTTAATGTAGGATATCTACCTGCTCCTACAATGCCACCCATGATGGCCCCGCCGGGAGCTCCTATGCCCGGCCAAGTGAACCTTCCTGCAAGGCTGCCACCTCCAGCGCCAATTCCAGGGAGCGCGCCGCAGCCATCATCGACCAATGGTGCACCGTTGGCTGCGCCACCAATGTATCAAGCAAATCCAGCAGCACCAGGAAGTGGAGGGTATGATAGTTTCACCACCATGGCTCAACCTTCCGAGTCTAACCATTAGAGCTTCTAATTCTTGTGCTGTCTGTGTATGATATCCAAAGCTCTTCTAATGCAAACCGAGACACAACAATTCTTAAAATATCTGAAGAAAAATTTGGAGCTTGGAGATTTAATGTAGAAAAGTTTTTTTTAGTGAGTGAGCAAAGGGAATGATTTTTGGTGGTTGATTCCTTGCTACTTGTTTGGAGATAGAATATATAATGTGATTATATGGTAAAATGGGCAACTCTGAATTCGTTCAATGAGGCTTTTACTTTTACTTTTCTATTTGTAATTTCTAGTGCTCGTGAATGGAATGTATTTGATCATGTCA

mRNA sequence

CCTTCTTTGCAAAAAAAGACTAGAAGTCTCTCTCTTCTTCGTTCTCGCAGAGTCGCTGATAGGGTTTTTGGTGGGTCTGCCATTGTTGCTTCTTCTTTGAGCTACCGCTATGCCTCGGTATTATTGTGACTATTGTGACACATATCTGACCCATGATTCTCCATCTGTGAGGAAGCAGCATAATGCAGGCTACAAACATAAGGCAAACGTGCGATCATACTATCAGCAATTTGAGGAGCAACAAACCCAAAATTTAATTGACCAGAGGATCAAAGAACATCTGGGTCAAGCAGCAGCATTCCAGCAGGTTGGTGCAGCCTACAATCAGCATTTACTCGGCCAAAGACCTCGTCTTCCTGTACTACCTACTCCTGTAATGCCGGGAGGTGCCCCGGGATTACTGCCCGGAATTAGGCCTCCAGTTTTGCCAAGACCAGTTCCTGGTGCTCCAGGATATCTACCTGCTCCTACAATGCCACCCATGATGGCCCCGCCGGGAGCTCCTATGCCCGGCCAAGTGAACCTTCCTGCAAGGCTGCCACCTCCAGCGCCAATTCCAGGGAGCGCGCCGCAGCCATCATCGACCAATGGTGCACCGTTGGCTGCGCCACCAATGTATCAAGCAAATCCAGCAGCACCAGGAAGTGGAGGGTATGATAGTTTCACCACCATGGCTCAACCTTCCGAGTCTAACCATTAGAGCTTCTAATTCTTGTGCTGTCTGTGTATGATATCCAAAGCTCTTCTAATGCAAACCGAGACACAACAATTCTTAAAATATCTGAAGAAAAATTTGGAGCTTGGAGATTTAATGTAGAAAAGTTTTTTTTAGTGAGTGAGCAAAGGGAATGATTTTTGGTGGTTGATTCCTTGCTACTTGTTTGGAGATAGAATATATAATGTGATTATATGGTAAAATGGGCAACTCTGAATTCGTTCAATGAGGCTTTTACTTTTACTTTTCTATTTGTAATTTCTAGTGCTCGTGAATGGAATGTATTTGATCATGTCA

Coding sequence (CDS)

ATGCCTCGGTATTATTGTGACTATTGTGACACATATCTGACCCATGATTCTCCATCTGTGAGGAAGCAGCATAATGCAGGCTACAAACATAAGGCAAACGTGCGATCATACTATCAGCAATTTGAGGAGCAACAAACCCAAAATTTAATTGACCAGAGGATCAAAGAACATCTGGGTCAAGCAGCAGCATTCCAGCAGGTTGGTGCAGCCTACAATCAGCATTTACTCGGCCAAAGACCTCGTCTTCCTGTACTACCTACTCCTGTAATGCCGGGAGGTGCCCCGGGATTACTGCCCGGAATTAGGCCTCCAGTTTTGCCAAGACCAGTTCCTGGTGCTCCAGGATATCTACCTGCTCCTACAATGCCACCCATGATGGCCCCGCCGGGAGCTCCTATGCCCGGCCAAGTGAACCTTCCTGCAAGGCTGCCACCTCCAGCGCCAATTCCAGGGAGCGCGCCGCAGCCATCATCGACCAATGGTGCACCGTTGGCTGCGCCACCAATGTATCAAGCAAATCCAGCAGCACCAGGAAGTGGAGGGTATGATAGTTTCACCACCATGGCTCAACCTTCCGAGTCTAACCATTAG

Protein sequence

MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQAAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAPTMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSGGYDSFTTMAQPSESNH
Homology
BLAST of Tan0004102 vs. ExPASy Swiss-Prot
Match: F6HQ26 (U1 small nuclear ribonucleoprotein C OS=Vitis vinifera OX=29760 GN=VIT_07s0104g01170 PE=3 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 6.3e-68
Identity = 151/215 (70.23%), Postives = 162/215 (75.35%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMP--GGAP-----GLLPGIRPPVLPR 120
            AAFQQVGAAYNQHL+       RPRLPVLPTP MP  G AP      L+PG+RPPVLPR
Sbjct: 61  TAAFQQVGAAYNQHLVSFPGNPPRPRLPVLPTPGMPVAGSAPLPMNSPLVPGMRPPVLPR 120

Query: 121 PVPGAPGYLPAPTMPPMMAPPGA---PMPGQVNLPARLPP----PAPIPGSAPQPSSTNG 180
           PVPGAPGY+PAP MP MMAPPGA   PMP   +LP   PP    P  +PGS   P+S   
Sbjct: 121 PVPGAPGYMPAPGMPSMMAPPGAPSMPMPPLNSLPR--PPTMNVPPAVPGSTSTPTSGGA 180

Query: 181 APLAAPPMYQANPAAPGSGGYDSFTTMAQPSESNH 197
             +   PMYQANPA P SGG+DSF   AQ  E+NH
Sbjct: 181 PSMMTQPMYQANPAGPTSGGFDSFNINAQGPEANH 213

BLAST of Tan0004102 vs. ExPASy Swiss-Prot
Match: C5XYW4 (U1 small nuclear ribonucleoprotein C-2 OS=Sorghum bicolor OX=4558 GN=Sb04g028260 PE=3 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 2.0e-50
Identity = 138/233 (59.23%), Postives = 151/233 (64.81%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR+YYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGGAP--GLLPGIRPPVLPRP-VPG 120
           AAAF Q GA +NQH+L       RPRLP+LPTP MP G P   L+PG+RPP+LP P VPG
Sbjct: 61  AAAF-QAGAPFNQHMLAFPGAVARPRLPILPTPGMPHGFPQAPLMPGVRPPILPAPGVPG 120

Query: 121 APGYLPAPTMPPMMAPPGA-PMPGQVNLPARLPPPAPIPGSAPQ-----------PSSTN 180
            PG    PTMP   APPG+ P PG    P  +P P   PGS P            P  T+
Sbjct: 121 YPG--APPTMPQPGAPPGSMPQPGAP--PGSMPQPGAPPGSMPMQMAPLPRPPTLPPPTS 180

Query: 181 GAP------LAAPP-MYQANPAAPG---SGGYDSFTTMAQ-------PSESNH 197
           G P       AAPP +YQANP AP    SG   +  T  Q       PSE NH
Sbjct: 181 GVPGAPIPNSAAPPAIYQANPPAPAGPTSGAPPAPPTAPQPAFSYALPSEGNH 228

BLAST of Tan0004102 vs. ExPASy Swiss-Prot
Match: C5XZK6 (U1 small nuclear ribonucleoprotein C-1 OS=Sorghum bicolor OX=4558 GN=Sb04g009880 PE=3 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 3.5e-50
Identity = 138/237 (58.23%), Postives = 150/237 (63.29%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR+YYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLG-----QRPRLPVLPTPVMPGG---APG--LLPGIRPPVLPRPV 120
           AAAF Q GA +NQH+L       RPRLP+LPTP MP G   APG  L+PG+RPP+L  P 
Sbjct: 61  AAAF-QAGAPFNQHMLTFPGAVARPRLPILPTPGMPHGFPQAPGAPLMPGVRPPIL--PA 120

Query: 121 PGAPGYLPAPTMPPMMAPPGAP---MPGQVNLPARLPPPAPIPGSAPQ-----------P 180
           PG PGY   P  PP M  PGAP   MP     P  +P P   PGS P            P
Sbjct: 121 PGIPGY---PGGPPTMLQPGAPPGSMPQPGAPPGSMPQPGAPPGSMPMQMAPLPRPPTLP 180

Query: 181 SSTNGAP------LAAPP-MYQANPAAPG---SGGYDSFTT-------MAQPSESNH 197
             T+G P       AAPP +YQ NP AP    SG   +  T        AQPSE NH
Sbjct: 181 PPTSGVPGAPIPNSAAPPAIYQTNPPAPAGPTSGAPPAPPTAPQPAFSYAQPSEGNH 231

BLAST of Tan0004102 vs. ExPASy Swiss-Prot
Match: Q56XE4 (U1 small nuclear ribonucleoprotein C OS=Arabidopsis thaliana OX=3702 GN=At4g03120 PE=2 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 1.3e-49
Identity = 121/197 (61.42%), Postives = 136/197 (69.04%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRIYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
              +QQVGA +NQH+L  RPR P++   + PG  P    G+RPPVLPRP+    GY+P P
Sbjct: 61  TGGYQQVGAVFNQHMLA-RPRPPMM---LPPGSMP---MGMRPPVLPRPMMPPQGYMPPP 120

Query: 121 TMPPMMAPPGAPM-PGQVNLPARLPPPAPIPGS-------APQPSSTNGAP-----LAAP 180
            +P MMAPPGAP+ P   N   R P  APIPG        AP P    G P     L  P
Sbjct: 121 GVPQMMAPPGAPLPPPPQNGILRPPGMAPIPGQGGGPPGMAPIPGQGGGPPPNYNGLPPP 180

Query: 181 PMYQANPAAPGSGGYDS 185
           P Y  NPAAP SG +++
Sbjct: 181 PPYHTNPAAPPSGNFNN 190

BLAST of Tan0004102 vs. ExPASy Swiss-Prot
Match: A8XW44 (U1 small nuclear ribonucleoprotein C OS=Caenorhabditis briggsae OX=6238 GN=CBG19656 PE=3 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 6.4e-20
Identity = 67/147 (45.58%), Postives = 86/147 (58.50%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MP+YYCDYCDT+LTHDSPSVRK HN G KHK NVR +YQ++ E Q Q L+DQ       +
Sbjct: 1   MPKYYCDYCDTFLTHDSPSVRKTHNGGRKHKDNVRMFYQKWMEDQAQKLVDQ-----TAR 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAP-GYLPA 120
           A A  ++  A  +  +G  P  PV   P+M GG PG+     P + PRP PG P G+  A
Sbjct: 61  AFATNRMHGAVPRTTMGMAPVPPVGHHPMM-GGPPGM-----PMMAPRPFPGPPVGFPGA 120

Query: 121 PTMPPMMAPPGAPMPGQVNLPARLPPP 147
           P + P   PP   + G   +P  +P P
Sbjct: 121 PGLAPFPGPP-MGLAGPPGMPPMMPRP 135

BLAST of Tan0004102 vs. NCBI nr
Match: XP_038899936.1 (U1 small nuclear ribonucleoprotein C isoform X1 [Benincasa hispida])

HSP 1 Score: 369.8 bits (948), Expect = 1.5e-98
Identity = 186/196 (94.90%), Postives = 191/196 (97.45%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPG APGL+PGIRPPVLPRPVPGAPGYLPAP
Sbjct: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGAAPGLMPGIRPPVLPRPVPGAPGYLPAP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGAPMPGQVN+PAR PPPAP+PGSAPQPSSTNGAPLA P MYQANPAAPGSG
Sbjct: 121 TMPPMMAPPGAPMPGQVNIPARPPPPAPVPGSAPQPSSTNGAPLAPPSMYQANPAAPGSG 180

Query: 181 GYDSFTTMAQPSESNH 197
           GY+SFTTMAQPSE NH
Sbjct: 181 GYESFTTMAQPSEPNH 196

BLAST of Tan0004102 vs. NCBI nr
Match: XP_023004664.1 (U1 small nuclear ribonucleoprotein C [Cucurbita maxima])

HSP 1 Score: 360.1 bits (923), Expect = 1.2e-95
Identity = 183/195 (93.85%), Postives = 187/195 (95.90%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGL+PGIRPPV PRPVPGAPGYLP P
Sbjct: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLMPGIRPPVFPRPVPGAPGYLPNP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGA MPGQVN+P+R PPPAPIPGS  QPSSTNGAPL APP YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGALMPGQVNVPSRPPPPAPIPGSTQQPSSTNGAPLTAPPTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQPSESN 196
           GYDSFTTMAQPSESN
Sbjct: 181 GYDSFTTMAQPSESN 195

BLAST of Tan0004102 vs. NCBI nr
Match: XP_004148886.1 (U1 small nuclear ribonucleoprotein C [Cucumis sativus] >KGN44835.1 hypothetical protein Csa_015811 [Cucumis sativus])

HSP 1 Score: 360.1 bits (923), Expect = 1.2e-95
Identity = 182/197 (92.39%), Postives = 191/197 (96.95%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAA+NQHLLGQRPRLPVLPTPVMPG APGL+PGIRPPVLPRP+PGAPGYLP P
Sbjct: 61  AAAFQQVGAAFNQHLLGQRPRLPVLPTPVMPGAAPGLMPGIRPPVLPRPIPGAPGYLPTP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGAP+PGQVN+P+R PPPAP+PGSAPQPSSTNGAPLAAP  YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGAPIPGQVNIPSRPPPPAPLPGSAPQPSSTNGAPLAAPSTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQP-SESNH 197
           GYDSFT+MAQP SESNH
Sbjct: 181 GYDSFTSMAQPSSESNH 197

BLAST of Tan0004102 vs. NCBI nr
Match: XP_008451425.1 (PREDICTED: U1 small nuclear ribonucleoprotein C [Cucumis melo] >KAA0057542.1 U1 small nuclear ribonucleoprotein C [Cucumis melo var. makuwa] >TYK18264.1 U1 small nuclear ribonucleoprotein C [Cucumis melo var. makuwa])

HSP 1 Score: 359.4 bits (921), Expect = 2.0e-95
Identity = 182/197 (92.39%), Postives = 191/197 (96.95%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PG APGL+PGIRPPVLPRP+PGAPGYLP P
Sbjct: 61  AAAFQQVGAAFNQHLLGQRPRLPVLPTPVIPGAAPGLMPGIRPPVLPRPIPGAPGYLPTP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGAP+PGQVN+P+R PPPAPIPGSAPQPSSTNGAPLAAP  YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGAPIPGQVNIPSRPPPPAPIPGSAPQPSSTNGAPLAAPSTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQP-SESNH 197
           GYDSFT+MAQP SESNH
Sbjct: 181 GYDSFTSMAQPSSESNH 197

BLAST of Tan0004102 vs. NCBI nr
Match: KAG6593155.1 (U1 small nuclear ribonucleoprotein C, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 358.6 bits (919), Expect = 3.4e-95
Identity = 182/195 (93.33%), Postives = 186/195 (95.38%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGG PGL+PGIRPPV PRPVPGAPGYLP P
Sbjct: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGVPGLMPGIRPPVFPRPVPGAPGYLPNP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGA MPGQVN+P+R PPPAPIPGS PQPSSTNGAPL AP  YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGALMPGQVNVPSRPPPPAPIPGSTPQPSSTNGAPLTAPQTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQPSESN 196
           GYDSFTTMAQPSESN
Sbjct: 181 GYDSFTTMAQPSESN 195

BLAST of Tan0004102 vs. ExPASy TrEMBL
Match: A0A6J1KWY5 (U1 small nuclear ribonucleoprotein C OS=Cucurbita maxima OX=3661 GN=LOC111497890 PE=3 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 5.7e-96
Identity = 183/195 (93.85%), Postives = 187/195 (95.90%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGL+PGIRPPV PRPVPGAPGYLP P
Sbjct: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLMPGIRPPVFPRPVPGAPGYLPNP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGA MPGQVN+P+R PPPAPIPGS  QPSSTNGAPL APP YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGALMPGQVNVPSRPPPPAPIPGSTQQPSSTNGAPLTAPPTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQPSESN 196
           GYDSFTTMAQPSESN
Sbjct: 181 GYDSFTTMAQPSESN 195

BLAST of Tan0004102 vs. ExPASy TrEMBL
Match: A0A0A0K515 (U1 small nuclear ribonucleoprotein C OS=Cucumis sativus OX=3659 GN=Csa_7G390140 PE=3 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 5.7e-96
Identity = 182/197 (92.39%), Postives = 191/197 (96.95%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAA+NQHLLGQRPRLPVLPTPVMPG APGL+PGIRPPVLPRP+PGAPGYLP P
Sbjct: 61  AAAFQQVGAAFNQHLLGQRPRLPVLPTPVMPGAAPGLMPGIRPPVLPRPIPGAPGYLPTP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGAP+PGQVN+P+R PPPAP+PGSAPQPSSTNGAPLAAP  YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGAPIPGQVNIPSRPPPPAPLPGSAPQPSSTNGAPLAAPSTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQP-SESNH 197
           GYDSFT+MAQP SESNH
Sbjct: 181 GYDSFTSMAQPSSESNH 197

BLAST of Tan0004102 vs. ExPASy TrEMBL
Match: A0A5A7UNW2 (U1 small nuclear ribonucleoprotein C OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold411G002060 PE=3 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 9.7e-96
Identity = 182/197 (92.39%), Postives = 191/197 (96.95%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PG APGL+PGIRPPVLPRP+PGAPGYLP P
Sbjct: 61  AAAFQQVGAAFNQHLLGQRPRLPVLPTPVIPGAAPGLMPGIRPPVLPRPIPGAPGYLPTP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGAP+PGQVN+P+R PPPAPIPGSAPQPSSTNGAPLAAP  YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGAPIPGQVNIPSRPPPPAPIPGSAPQPSSTNGAPLAAPSTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQP-SESNH 197
           GYDSFT+MAQP SESNH
Sbjct: 181 GYDSFTSMAQPSSESNH 197

BLAST of Tan0004102 vs. ExPASy TrEMBL
Match: A0A1S3BS92 (U1 small nuclear ribonucleoprotein C OS=Cucumis melo OX=3656 GN=LOC103492724 PE=3 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 9.7e-96
Identity = 182/197 (92.39%), Postives = 191/197 (96.95%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAA+NQHLLGQRPRLPVLPTPV+PG APGL+PGIRPPVLPRP+PGAPGYLP P
Sbjct: 61  AAAFQQVGAAFNQHLLGQRPRLPVLPTPVIPGAAPGLMPGIRPPVLPRPIPGAPGYLPTP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGAP+PGQVN+P+R PPPAPIPGSAPQPSSTNGAPLAAP  YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGAPIPGQVNIPSRPPPPAPIPGSAPQPSSTNGAPLAAPSTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQP-SESNH 197
           GYDSFT+MAQP SESNH
Sbjct: 181 GYDSFTSMAQPSSESNH 197

BLAST of Tan0004102 vs. ExPASy TrEMBL
Match: A0A6J1H7P8 (U1 small nuclear ribonucleoprotein C OS=Cucurbita moschata OX=3662 GN=LOC111460839 PE=3 SV=1)

HSP 1 Score: 355.1 bits (910), Expect = 1.8e-94
Identity = 181/195 (92.82%), Postives = 186/195 (95.38%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
           AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGL+PGIRPPV PRP+PGAPGYLP P
Sbjct: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLMPGIRPPVFPRPLPGAPGYLPNP 120

Query: 121 TMPPMMAPPGAPMPGQVNLPARLPPPAPIPGSAPQPSSTNGAPLAAPPMYQANPAAPGSG 180
           TMPPMMAPPGA MPGQVN+P+R PPPAPIPGS PQ SSTNGAPL AP  YQANPAAPGSG
Sbjct: 121 TMPPMMAPPGALMPGQVNVPSRPPPPAPIPGSTPQLSSTNGAPLTAPQTYQANPAAPGSG 180

Query: 181 GYDSFTTMAQPSESN 196
           GYDSFTTMAQPSESN
Sbjct: 181 GYDSFTTMAQPSESN 195

BLAST of Tan0004102 vs. TAIR 10
Match: AT4G03120.1 (C2H2 and C2HC zinc fingers superfamily protein )

HSP 1 Score: 197.6 bits (501), Expect = 9.4e-51
Identity = 121/197 (61.42%), Postives = 136/197 (69.04%), Query Frame = 0

Query: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRSYYQQFEEQQTQNLIDQRIKEHLGQ 60
           MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVR YYQQFEEQQTQ+LIDQRIKEHLGQ
Sbjct: 1   MPRYYCDYCDTYLTHDSPSVRKQHNAGYKHKANVRIYYQQFEEQQTQSLIDQRIKEHLGQ 60

Query: 61  AAAFQQVGAAYNQHLLGQRPRLPVLPTPVMPGGAPGLLPGIRPPVLPRPVPGAPGYLPAP 120
              +QQVGA +NQH+L  RPR P++   + PG  P    G+RPPVLPRP+    GY+P P
Sbjct: 61  TGGYQQVGAVFNQHMLA-RPRPPMM---LPPGSMP---MGMRPPVLPRPMMPPQGYMPPP 120

Query: 121 TMPPMMAPPGAPM-PGQVNLPARLPPPAPIPGS-------APQPSSTNGAP-----LAAP 180
            +P MMAPPGAP+ P   N   R P  APIPG        AP P    G P     L  P
Sbjct: 121 GVPQMMAPPGAPLPPPPQNGILRPPGMAPIPGQGGGPPGMAPIPGQGGGPPPNYNGLPPP 180

Query: 181 PMYQANPAAPGSGGYDS 185
           P Y  NPAAP SG +++
Sbjct: 181 PPYHTNPAAPPSGNFNN 190

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F6HQ266.3e-6870.23U1 small nuclear ribonucleoprotein C OS=Vitis vinifera OX=29760 GN=VIT_07s0104g0... [more]
C5XYW42.0e-5059.23U1 small nuclear ribonucleoprotein C-2 OS=Sorghum bicolor OX=4558 GN=Sb04g028260... [more]
C5XZK63.5e-5058.23U1 small nuclear ribonucleoprotein C-1 OS=Sorghum bicolor OX=4558 GN=Sb04g009880... [more]
Q56XE41.3e-4961.42U1 small nuclear ribonucleoprotein C OS=Arabidopsis thaliana OX=3702 GN=At4g0312... [more]
A8XW446.4e-2045.58U1 small nuclear ribonucleoprotein C OS=Caenorhabditis briggsae OX=6238 GN=CBG19... [more]
Match NameE-valueIdentityDescription
XP_038899936.11.5e-9894.90U1 small nuclear ribonucleoprotein C isoform X1 [Benincasa hispida][more]
XP_023004664.11.2e-9593.85U1 small nuclear ribonucleoprotein C [Cucurbita maxima][more]
XP_004148886.11.2e-9592.39U1 small nuclear ribonucleoprotein C [Cucumis sativus] >KGN44835.1 hypothetical ... [more]
XP_008451425.12.0e-9592.39PREDICTED: U1 small nuclear ribonucleoprotein C [Cucumis melo] >KAA0057542.1 U1 ... [more]
KAG6593155.13.4e-9593.33U1 small nuclear ribonucleoprotein C, partial [Cucurbita argyrosperma subsp. sor... [more]
Match NameE-valueIdentityDescription
A0A6J1KWY55.7e-9693.85U1 small nuclear ribonucleoprotein C OS=Cucurbita maxima OX=3661 GN=LOC111497890... [more]
A0A0A0K5155.7e-9692.39U1 small nuclear ribonucleoprotein C OS=Cucumis sativus OX=3659 GN=Csa_7G390140 ... [more]
A0A5A7UNW29.7e-9692.39U1 small nuclear ribonucleoprotein C OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
A0A1S3BS929.7e-9692.39U1 small nuclear ribonucleoprotein C OS=Cucumis melo OX=3656 GN=LOC103492724 PE=... [more]
A0A6J1H7P81.8e-9492.82U1 small nuclear ribonucleoprotein C OS=Cucurbita moschata OX=3662 GN=LOC1114608... [more]
Match NameE-valueIdentityDescription
AT4G03120.19.4e-5161.42C2H2 and C2HC zinc fingers superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003604Matrin/U1-C-like, C2H2-type zinc fingerSMARTSM00451ZnF_U1_5coord: 1..37
e-value: 1.4E-12
score: 57.8
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 1..61
e-value: 1.5E-29
score: 103.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..153
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..196
NoneNo IPR availablePANTHERPTHR31148:SF2U1 SMALL NUCLEAR RIBONUCLEOPROTEIN Ccoord: 1..180
IPR017340U1 small nuclear ribonucleoprotein CPIRSFPIRSF037969U1-Ccoord: 1..196
e-value: 2.8E-39
score: 134.0
IPR017340U1 small nuclear ribonucleoprotein CPANTHERPTHR31148U1 SMALL NUCLEAR RIBONUCLEOPROTEIN Ccoord: 1..180
IPR017340U1 small nuclear ribonucleoprotein CHAMAPMF_03153U1_Ccoord: 1..169
score: 11.782591
IPR013085U1-C, C2H2-type zinc fingerPFAMPF06220zf-U1coord: 1..38
e-value: 5.3E-22
score: 77.3
IPR000690Matrin/U1-C, C2H2-type zinc fingerPROSITEPS50171ZF_MATRINcoord: 4..36
score: 12.522273
IPR036236Zinc finger C2H2 superfamilySUPERFAMILY57667beta-beta-alpha zinc fingerscoord: 1..55

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0004102.1Tan0004102.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000395 mRNA 5'-splice site recognition
biological_process GO:0000387 spliceosomal snRNP assembly
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0000243 commitment complex
cellular_component GO:0005685 U1 snRNP
cellular_component GO:0071004 U2-type prespliceosome
cellular_component GO:0005634 nucleus
molecular_function GO:0003729 mRNA binding
molecular_function GO:0030627 pre-mRNA 5'-splice site binding
molecular_function GO:0030619 U1 snRNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding