Tan0005219 (gene) Snake gourd v1

Overview
NameTan0005219
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
LocationLG10: 1141227 .. 1143075 (-)
RNA-Seq ExpressionTan0005219
SyntenyTan0005219
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACAAAGACAAGGCTTTTCAATATATCTCTGCCTCTAATTTTTATATTCTATTTGGATTTTCATTTTCTTAGCTTTTCAAAACAATTATGTCAAATTTTCCTCGCTTTGGCCGTACACGTCAACGTCCTTCTGTGGTGGCGCCGCCAGTTCTCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAATTTCTACCGATTGCTCCGGCCACCCAAACTCTTCAACCTTTAGACCCCAACCTGGCGGTGCCGGCGCCGGCTTCTCCTCTGATTGTGTCCCCTCGCCAGCTTCCCTCTCCGGTGAAGAAAGCCATGTCACCCTTTGCTTCGCCGAGATATGGTGGTCCTGCCACACGTGTGCCCAGCCCGCCGCCGGCGAGGGTCACACGATCGCCTCCGATTTCGCCTGCAAAGAAATATCCCGAAAGGAGAATTGGCCAAACAAGCCCACCTCACTCGCCGGCAAAGTCCCGGCGAACGACTCCGCCGCCTTCTCCTCTCGCTCTGCCTCGTACCCAGGTCACCGCCGTAAATGGGACCACGACTCAGCCCAGGTATTAAAGTTCAATTTTTGTTTTTCTTTCATGTTTTGAGTACGAGTGGATAAATTACTTGGTAATAATATCTTAATATAAATTCGATCTATTTCTCTCTTCATTTAAACGCCGGACATGTGGTTAGTCTAATAATTTCTCGTAAAATTGAGATTTAATACTTTAATATATGTCAAAGATGTAATATTTTAATTCATTAAATCTATGCTCAACATGATATGTTCAATTTAATATAGATGAAATTGAAAATTTGGTCACTAAAATTGTTAGGAGTTTAGATTTAGTTTTAATTTGGTCTTGGTAATTTTAAAGTTTTAAACGGGTTCATTTCATCTATTTTAATTTTTAAAATTAATTGTCATTGATTTAAGCTTTGTTTCAATAATTCGTTTCTAAATTAAGTTTACCAACTCTATTGTCACATTAAAAAAAAAAAGGTCATTATGATAACAAAATGAATCATTTGTAAAGTTTAGTTCAATTATAGAAACTAAATTAAAATTAAACTCAAATTATAAGTAAAATAGAATTTTAGAATTTAAACAACGTAGACTAAAATTTTTGGGATTTCAACGTCTTAATTTCAGGCAAATTAAAGGTATATCTCTTCACTAGAACCAAACCTTATTAAAAAAAATGAATTATTTATTCAAATTTTCAATTCTAGGATTCAGCCGGAGGTGGAGCAGAAAAACATTGTATACAACAAGACCGTCGAGAGGCCGGCGAAGTCTGATCGGCCGTCGGAGTACGGATCCGGCAAGCCACCGCAGAGGGAGCAGGCGGAGGTTATAAACCTCACCGGACATAACGTCGGCGCCGTCATGGAAGTTAATCAGTCCTCCCCCGGCCACCGTTTGGGGGGAGAAACTACCAAAAAGAAGGAAACAGAAGGCGGCGGCGGCTGCGGCGTCCGCCATGGAAATGATCAGAAGAAAACGGAGGCAAAGAAGGAACTCCCGATGACGGCATTTATGAACAGCAACTTTCAGAGTGTAAACAATTCGGTTCTGTACGACTCGTCGTGCAACCACCGTGATCCCGGTCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGTTGACGGCCGTAAGAATTACAAGCCATAGAGATAATGGCTGGAGAACTATTTAATCATAAGAAATATGCCATTTACTTGGTGTTTAAAAAAAGCCATTTACTCCTCCTTTTTTTTTTGAAATAAAAAAACCATTTACTTGGAATCGTATTTTAATCGATATGTAAGCATTAATAAAATATATATTATGATTAATTTCCTATTT

mRNA sequence

TACAAAGACAAGGCTTTTCAATATATCTCTGCCTCTAATTTTTATATTCTATTTGGATTTTCATTTTCTTAGCTTTTCAAAACAATTATGTCAAATTTTCCTCGCTTTGGCCGTACACGTCAACGTCCTTCTGTGGTGGCGCCGCCAGTTCTCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAATTTCTACCGATTGCTCCGGCCACCCAAACTCTTCAACCTTTAGACCCCAACCTGGCGGTGCCGGCGCCGGCTTCTCCTCTGATTGTGTCCCCTCGCCAGCTTCCCTCTCCGGTGAAGAAAGCCATGTCACCCTTTGCTTCGCCGAGATATGGTGGTCCTGCCACACGTGTGCCCAGCCCGCCGCCGGCGAGGGTCACACGATCGCCTCCGATTTCGCCTGCAAAGAAATATCCCGAAAGGAGAATTGGCCAAACAAGCCCACCTCACTCGCCGGCAAAGTCCCGGCGAACGACTCCGCCGCCTTCTCCTCTCGCTCTGCCTCGTACCCAGGTCACCGCCGTAAATGGGACCACGACTCAGCCCAGGATTCAGCCGGAGGTGGAGCAGAAAAACATTGTATACAACAAGACCGTCGAGAGGCCGGCGAAGTCTGATCGGCCGTCGGAGTACGGATCCGGCAAGCCACCGCAGAGGGAGCAGGCGGAGGTTATAAACCTCACCGGACATAACGTCGGCGCCGTCATGGAAGTTAATCAGTCCTCCCCCGGCCACCGTTTGGGGGGAGAAACTACCAAAAAGAAGGAAACAGAAGGCGGCGGCGGCTGCGGCGTCCGCCATGGAAATGATCAGAAGAAAACGGAGGCAAAGAAGGAACTCCCGATGACGGCATTTATGAACAGCAACTTTCAGAGTGTAAACAATTCGGTTCTGTACGACTCGTCGTGCAACCACCGTGATCCCGGTCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGTTGACGGCCGTAAGAATTACAAGCCATAGAGATAATGGCTGGAGAACTATTTAATCATAAGAAATATGCCATTTACTTGGTGTTTAAAAAAAGCCATTTACTCCTCCTTTTTTTTTTGAAATAAAAAAACCATTTACTTGGAATCGTATTTTAATCGATATGTAAGCATTAATAAAATATATATTATGATTAATTTCCTATTT

Coding sequence (CDS)

ATGTCAAATTTTCCTCGCTTTGGCCGTACACGTCAACGTCCTTCTGTGGTGGCGCCGCCAGTTCTCGCCGCCGCACAGCCTGCTGCAGAACCGAAGCCTGAATTTCTACCGATTGCTCCGGCCACCCAAACTCTTCAACCTTTAGACCCCAACCTGGCGGTGCCGGCGCCGGCTTCTCCTCTGATTGTGTCCCCTCGCCAGCTTCCCTCTCCGGTGAAGAAAGCCATGTCACCCTTTGCTTCGCCGAGATATGGTGGTCCTGCCACACGTGTGCCCAGCCCGCCGCCGGCGAGGGTCACACGATCGCCTCCGATTTCGCCTGCAAAGAAATATCCCGAAAGGAGAATTGGCCAAACAAGCCCACCTCACTCGCCGGCAAAGTCCCGGCGAACGACTCCGCCGCCTTCTCCTCTCGCTCTGCCTCGTACCCAGGTCACCGCCGTAAATGGGACCACGACTCAGCCCAGGATTCAGCCGGAGGTGGAGCAGAAAAACATTGTATACAACAAGACCGTCGAGAGGCCGGCGAAGTCTGATCGGCCGTCGGAGTACGGATCCGGCAAGCCACCGCAGAGGGAGCAGGCGGAGGTTATAAACCTCACCGGACATAACGTCGGCGCCGTCATGGAAGTTAATCAGTCCTCCCCCGGCCACCGTTTGGGGGGAGAAACTACCAAAAAGAAGGAAACAGAAGGCGGCGGCGGCTGCGGCGTCCGCCATGGAAATGATCAGAAGAAAACGGAGGCAAAGAAGGAACTCCCGATGACGGCATTTATGAACAGCAACTTTCAGAGTGTAAACAATTCGGTTCTGTACGACTCGTCGTGCAACCACCGTGATCCCGGTCTGCATCTCGCGTTTTCCGATGCGGCGGACGGCGACGGAGCCACTGTTGACGGCCGTAAGAATTACAAGCCATAG

Protein sequence

MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASPLIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTSPPHSPAKSRRTTPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSDRPSEYGSGKPPQREQAEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGVRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGATVDGRKNYKP
Homology
BLAST of Tan0005219 vs. NCBI nr
Match: XP_038879417.1 (proline-rich receptor-like protein kinase PERK8 [Benincasa hispida])

HSP 1 Score: 367.1 bits (941), Expect = 1.5e-97
Identity = 205/309 (66.34%), Postives = 235/309 (76.05%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           M+N PR GRTRQRPS VAPP+LAA QP AEPKPE  P AP +   QP++P    PAPASP
Sbjct: 1   MANLPRIGRTRQRPSAVAPPLLAAVQPTAEPKPEISPFAPTSIQTQPIEP--TTPAPASP 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
           L  SPR + SP KKA SPFASP+YG   TRV SPP A+ TRSPP SP  KY E R G+T+
Sbjct: 61  LRQSPRLITSPPKKATSPFASPKYGDSLTRVDSPPAAKATRSPPDSPTNKYLETRNGETT 120

Query: 121 PPHSPAKSRRT-TPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
           PP SPAKSRRT TPP SPL LPRT V + + TT  PR QP VE K IVYNK VE+P K+D
Sbjct: 121 PPLSPAKSRRTKTPPLSPLTLPRTPVISWDETTALPRTQPVVETKGIVYNKAVEKPTKTD 180

Query: 181 RPSEYGSGKPPQREQ--AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
           RPSEYGSGKP Q++Q  AEVINL GHNVGAVME+N+SS G+RLGGET K KET+GG   G
Sbjct: 181 RPSEYGSGKPHQKQQATAEVINLNGHNVGAVMEINKSSDGYRLGGETVKNKETKGG---G 240

Query: 241 VRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGAT 300
           V HG+ +K   AK   P+TAFMN+NFQS+NNS+LYDSSCNH DPGLHL+  ++ DGDGAT
Sbjct: 241 VHHGHKEKNKGAKIVPPVTAFMNTNFQSINNSILYDSSCNHHDPGLHLSLPESVDGDGAT 300

Query: 301 VDGRKNYKP 307
           V G K+YKP
Sbjct: 301 VYGHKSYKP 304

BLAST of Tan0005219 vs. NCBI nr
Match: KAG6597060.1 (hypothetical protein SDJN03_10240, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 349.0 bits (894), Expect = 4.2e-92
Identity = 198/310 (63.87%), Postives = 230/310 (74.19%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           MSNFPRFG +RQRPS++APPVL  + PAA                 PLDP      P  P
Sbjct: 1   MSNFPRFGLSRQRPSLMAPPVLPTSLPAA-----------------PLDP----IEPPKP 60

Query: 61  LIVS----PRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRI 120
           +++S    PR L SP +K  SP ASP YG   TRVPSPPPA+   SPP+SPAKKYP+R I
Sbjct: 61  VLLSRKSPPRWLTSPEEKPTSPIASPEYGSSVTRVPSPPPAKDQLSPPVSPAKKYPDRWI 120

Query: 121 GQTSPPHSPAKSRRTTPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPA 180
            QTSP  SP++S RT+PPP P ALP TQ TAVNGTTTQP+IQPEVE+K+ VYNKTVE+PA
Sbjct: 121 AQTSPLQSPSRSLRTSPPPPPFALPPTQFTAVNGTTTQPKIQPEVEKKSFVYNKTVEKPA 180

Query: 181 KSDRPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGG 240
           KSD  SEYGSGKP ++++  E INL GHNVGAVME+++S+ GHRLGGET +         
Sbjct: 181 KSDWTSEYGSGKPHEKQKVTEAINLAGHNVGAVMEIDKSTVGHRLGGETVRG-------- 240

Query: 241 CGVRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDG 300
            GVR GN++KK + KKE+PMTAFMNSNFQSVNNSVLY SSCNHRDPGLHL F+DAADGDG
Sbjct: 241 -GVRDGNEEKKKKEKKEVPMTAFMNSNFQSVNNSVLYGSSCNHRDPGLHLKFADAADGDG 280

Query: 301 ATVDGRKNYK 306
           ATVDGRKNYK
Sbjct: 301 ATVDGRKNYK 280

BLAST of Tan0005219 vs. NCBI nr
Match: XP_022951357.1 (sulfated surface glycoprotein 185-like [Cucurbita moschata])

HSP 1 Score: 344.7 bits (883), Expect = 8.0e-91
Identity = 200/315 (63.49%), Postives = 231/315 (73.33%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           MSNFPRFG +RQRPS+ APPVL    PAAE KPE  P    +Q+LQ              
Sbjct: 1   MSNFPRFGLSRQRPSMAAPPVLPTTLPAAELKPETQPFYKRSQSLQ-------------- 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
                     P +K  SP ASP+YG   TRVPSPPP +   SPP+SPAKKYP+ R+ QTS
Sbjct: 61  ----------PAEKPTSPIASPKYGPSVTRVPSPPPPKDELSPPVSPAKKYPD-RLSQTS 120

Query: 121 PPHSPAKSRRTT--PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKS 180
           P  SP++S RT+  PPP PLALP TQ TAVN TTTQPRIQPEVE+K+IVYNKTVE+  KS
Sbjct: 121 PLQSPSRSLRTSPPPPPPPLALPPTQFTAVNETTTQPRIQPEVEKKSIVYNKTVEKLVKS 180

Query: 181 DRPSEYGSGKPPQREQA-EVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
           DRPSEYGSGKP ++++A E INL GHNVGAVME+++SS GHRLGGET +K ETEGGG   
Sbjct: 181 DRPSEYGSGKPYEKQKAVESINLAGHNVGAVMEIDKSSVGHRLGGETVRKNETEGGG--- 240

Query: 241 VRHGNDQKKTEAKKE-------LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDA 300
            R GN++KK E KK+       +PMTAFMNSNFQSVNNSVLYDSSC+HRDPGLHL F+DA
Sbjct: 241 -RDGNEEKKKEEKKKKEKKKKNVPMTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLKFADA 286

Query: 301 ADGDGATVDGRKNYK 306
           ADGDGA VDGRK+YK
Sbjct: 301 ADGDGAAVDGRKSYK 286

BLAST of Tan0005219 vs. NCBI nr
Match: XP_022974816.1 (wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima] >XP_022975275.1 wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima])

HSP 1 Score: 339.0 bits (868), Expect = 4.4e-89
Identity = 197/312 (63.14%), Postives = 226/312 (72.44%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           MSNFPRFGR RQRPSV APPVL    PAAE KPE LP    +QTLQ              
Sbjct: 1   MSNFPRFGRPRQRPSVAAPPVLPTTLPAAEHKPETLPFYRTSQTLQ-------------- 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
                     P +K  SP ASP+YG   T VPSPPPA+   SPP+SPAKKYP+ R+ QTS
Sbjct: 61  ----------PAEKPTSPIASPKYGPSVTLVPSPPPAKDELSPPVSPAKKYPD-RLSQTS 120

Query: 121 PPHSPAKSRRTT-PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
           P  SP++S RT+ PPP PLALP T  TA +G TTQPRIQ EVE+K+IVYNKTVE+P KSD
Sbjct: 121 PLQSPSRSLRTSPPPPPPLALPPTHFTAKDGNTTQPRIQTEVEKKSIVYNKTVEKPVKSD 180

Query: 181 RPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGV 240
           RP EYGSGK  ++++ AE INL GHNVGAVME+++ S  HRLGGET +K +TEGG   G 
Sbjct: 181 RPLEYGSGKSHEKQKTAESINLAGHNVGAVMEIDKLSASHRLGGETVRKNKTEGGD--GG 240

Query: 241 RHGNDQKKTEAKKE-----LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADG 300
           R GN++KK + KK+     +PMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHL F+DAADG
Sbjct: 241 RDGNEEKKKKEKKDKKKKKVPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLKFADAADG 285

Query: 301 DGATVDGRKNYK 306
           DGA VDGRKNYK
Sbjct: 301 DGAAVDGRKNYK 285

BLAST of Tan0005219 vs. NCBI nr
Match: KAG6597059.1 (hypothetical protein SDJN03_10239, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 337.8 bits (865), Expect = 9.7e-89
Identity = 200/325 (61.54%), Postives = 229/325 (70.46%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           MSNFPRFG +RQRPS+ APPVL    PAAE KPE  P    +Q+LQ              
Sbjct: 1   MSNFPRFGLSRQRPSMAAPPVLPTTLPAAELKPETQPFYKRSQSLQ-------------- 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
                     P +K  SP ASP+YG   TRVPSPPP +   SPP+SPAKKYP+ R+ QTS
Sbjct: 61  ----------PAEKPTSPIASPKYGPSVTRVPSPPPPKDELSPPVSPAKKYPD-RLSQTS 120

Query: 121 PPHSPAKSRRTT-PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
           P  SP++S RT+ PPP PLALP T  TAVN TTTQPRIQPEVE+K+IVYNKTVERP KSD
Sbjct: 121 PLQSPSRSLRTSPPPPPPLALPPTHFTAVNRTTTQPRIQPEVEKKSIVYNKTVERPVKSD 180

Query: 181 RPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGV 240
           R SEYGSGKP ++++ AE INL GHNVGAVME+++SS GHRLGGET +K ETEGG     
Sbjct: 181 RLSEYGSGKPHEKQKAAESINLAGHNVGAVMEIDKSSVGHRLGGETVRKNETEGGD--VG 240

Query: 241 RHGNDQKKTEAK------------------KELPMTAFMNSNFQSVNNSVLYDSSCNHRD 300
           R GN++KK E K                  K++PMTAFMNSNFQSVNNSVLYDSSC+HRD
Sbjct: 241 RDGNEEKKKEEKKKEEKKKKEKKEKKETKEKKVPMTAFMNSNFQSVNNSVLYDSSCSHRD 298

Query: 301 PGLHLAFSDAADGDGATVDGRKNYK 306
           PGLHL F+DAADGDGA VDGRKNYK
Sbjct: 301 PGLHLKFADAADGDGAAVDGRKNYK 298

BLAST of Tan0005219 vs. ExPASy TrEMBL
Match: A0A6J1GHE5 (sulfated surface glycoprotein 185-like OS=Cucurbita moschata OX=3662 GN=LOC111454206 PE=4 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 3.8e-91
Identity = 200/315 (63.49%), Postives = 231/315 (73.33%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           MSNFPRFG +RQRPS+ APPVL    PAAE KPE  P    +Q+LQ              
Sbjct: 1   MSNFPRFGLSRQRPSMAAPPVLPTTLPAAELKPETQPFYKRSQSLQ-------------- 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
                     P +K  SP ASP+YG   TRVPSPPP +   SPP+SPAKKYP+ R+ QTS
Sbjct: 61  ----------PAEKPTSPIASPKYGPSVTRVPSPPPPKDELSPPVSPAKKYPD-RLSQTS 120

Query: 121 PPHSPAKSRRTT--PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKS 180
           P  SP++S RT+  PPP PLALP TQ TAVN TTTQPRIQPEVE+K+IVYNKTVE+  KS
Sbjct: 121 PLQSPSRSLRTSPPPPPPPLALPPTQFTAVNETTTQPRIQPEVEKKSIVYNKTVEKLVKS 180

Query: 181 DRPSEYGSGKPPQREQA-EVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
           DRPSEYGSGKP ++++A E INL GHNVGAVME+++SS GHRLGGET +K ETEGGG   
Sbjct: 181 DRPSEYGSGKPYEKQKAVESINLAGHNVGAVMEIDKSSVGHRLGGETVRKNETEGGG--- 240

Query: 241 VRHGNDQKKTEAKKE-------LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDA 300
            R GN++KK E KK+       +PMTAFMNSNFQSVNNSVLYDSSC+HRDPGLHL F+DA
Sbjct: 241 -RDGNEEKKKEEKKKKEKKKKNVPMTAFMNSNFQSVNNSVLYDSSCSHRDPGLHLKFADA 286

Query: 301 ADGDGATVDGRKNYK 306
           ADGDGA VDGRK+YK
Sbjct: 301 ADGDGAAVDGRKSYK 286

BLAST of Tan0005219 vs. ExPASy TrEMBL
Match: A0A6J1ICG8 (wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita maxima OX=3661 GN=LOC111473600 PE=4 SV=1)

HSP 1 Score: 339.0 bits (868), Expect = 2.1e-89
Identity = 197/312 (63.14%), Postives = 226/312 (72.44%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           MSNFPRFGR RQRPSV APPVL    PAAE KPE LP    +QTLQ              
Sbjct: 1   MSNFPRFGRPRQRPSVAAPPVLPTTLPAAEHKPETLPFYRTSQTLQ-------------- 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
                     P +K  SP ASP+YG   T VPSPPPA+   SPP+SPAKKYP+ R+ QTS
Sbjct: 61  ----------PAEKPTSPIASPKYGPSVTLVPSPPPAKDELSPPVSPAKKYPD-RLSQTS 120

Query: 121 PPHSPAKSRRTT-PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
           P  SP++S RT+ PPP PLALP T  TA +G TTQPRIQ EVE+K+IVYNKTVE+P KSD
Sbjct: 121 PLQSPSRSLRTSPPPPPPLALPPTHFTAKDGNTTQPRIQTEVEKKSIVYNKTVEKPVKSD 180

Query: 181 RPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGV 240
           RP EYGSGK  ++++ AE INL GHNVGAVME+++ S  HRLGGET +K +TEGG   G 
Sbjct: 181 RPLEYGSGKSHEKQKTAESINLAGHNVGAVMEIDKLSASHRLGGETVRKNKTEGGD--GG 240

Query: 241 RHGNDQKKTEAKKE-----LPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADG 300
           R GN++KK + KK+     +PMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHL F+DAADG
Sbjct: 241 RDGNEEKKKKEKKDKKKKKVPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLKFADAADG 285

Query: 301 DGATVDGRKNYK 306
           DGA VDGRKNYK
Sbjct: 301 DGAAVDGRKNYK 285

BLAST of Tan0005219 vs. ExPASy TrEMBL
Match: A0A5A7TZ24 (Zyxin-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00760 PE=4 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 1.5e-87
Identity = 201/310 (64.84%), Postives = 226/310 (72.90%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           M+N PRFGR RQR   V PPV AAAQPA EP+ + LP A    T          PAPASP
Sbjct: 1   MANLPRFGRARQRLPAVPPPVPAAAQPAVEPRYQILPFATTITT----------PAPASP 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
              SPR L SP KKA SPFASP+YG   TR+   P A+ T SPP S   KY ERR G+T+
Sbjct: 61  RRESPRPLSSPPKKATSPFASPKYGDSRTRLDRSPAAKATMSPPDSVRDKYFERRNGETT 120

Query: 121 PPHSPAKSRRT-TPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
           PP SPAKSRR  TPP SPLALPR QV   NGTT QPR+QPEVE K IVYNKTVE+P+KS+
Sbjct: 121 PPLSPAKSRRAKTPPLSPLALPRNQVFTGNGTTAQPRVQPEVETKGIVYNKTVEKPSKSN 180

Query: 181 RPS-EYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
           R S EYGS K  Q++Q  EVI L GHNVGAVME+N+SS G+RLGGET KK ETE  G   
Sbjct: 181 RSSGEYGSSKSHQKKQKPEVIKLKGHNVGAVMEINKSSTGYRLGGETLKKNETEDVGDVH 240

Query: 241 VRHGNDQKKTEA-KKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGA 300
             +G++ KKTE  KKE P+TAFMNSNFQSVNNS+L+DSSCNHRDPGLHLAF DA DGDGA
Sbjct: 241 -GYGHEDKKTETKKKEPPITAFMNSNFQSVNNSLLFDSSCNHRDPGLHLAFPDAVDGDGA 299

Query: 301 TVDGRKNYKP 307
            VDG+K+YKP
Sbjct: 301 IVDGQKSYKP 299

BLAST of Tan0005219 vs. ExPASy TrEMBL
Match: A0A1S3AV38 (zyxin-like OS=Cucumis melo OX=3656 GN=LOC103483158 PE=4 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 1.5e-87
Identity = 201/310 (64.84%), Postives = 226/310 (72.90%), Query Frame = 0

Query: 1   MSNFPRFGRTRQRPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASP 60
           M+N PRFGR RQR   V PPV AAAQPA EP+ + LP A    T          PAPASP
Sbjct: 1   MANLPRFGRARQRLPAVPPPVPAAAQPAVEPRYQILPFATTITT----------PAPASP 60

Query: 61  LIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTS 120
              SPR L SP KKA SPFASP+YG   TR+   P A+ T SPP S   KY ERR G+T+
Sbjct: 61  RRESPRPLSSPPKKATSPFASPKYGDSRTRLDRSPAAKATMSPPDSVRDKYFERRNGETT 120

Query: 121 PPHSPAKSRRT-TPPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKTVERPAKSD 180
           PP SPAKSRR  TPP SPLALPR QV   NGTT QPR+QPEVE K IVYNKTVE+P+KS+
Sbjct: 121 PPLSPAKSRRAKTPPLSPLALPRNQVFTGNGTTAQPRVQPEVETKGIVYNKTVEKPSKSN 180

Query: 181 RPS-EYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCG 240
           R S EYGS K  Q++Q  EVI L GHNVGAVME+N+SS G+RLGGET KK ETE  G   
Sbjct: 181 RSSGEYGSSKSHQKKQKPEVIKLKGHNVGAVMEINKSSTGYRLGGETLKKNETEDVGDVH 240

Query: 241 VRHGNDQKKTEA-KKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDAADGDGA 300
             +G++ KKTE  KKE P+TAFMNSNFQSVNNS+L+DSSCNHRDPGLHLAF DA DGDGA
Sbjct: 241 -GYGHEDKKTETKKKEPPITAFMNSNFQSVNNSLLFDSSCNHRDPGLHLAFPDAVDGDGA 299

Query: 301 TVDGRKNYKP 307
            VDG+K+YKP
Sbjct: 301 IVDGQKSYKP 299

BLAST of Tan0005219 vs. ExPASy TrEMBL
Match: A0A6J1GDX3 (wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita moschata OX=3662 GN=LOC111453294 PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 4.1e-85
Identity = 179/255 (70.20%), Postives = 202/255 (79.22%), Query Frame = 0

Query: 55  PAPASPLIVSPRQLPSPVKKAMSPFASPRYGGPATRVPSPPPAR-VTRSPPISPAKKYPE 114
           P PASPL  SP  LPSP KK MSP ASP+YG   TRVPSPPP +    SPP+SPAKKYP+
Sbjct: 27  PEPASPLKSSPHWLPSPAKKPMSPIASPKYGSSVTRVPSPPPPKDELLSPPVSPAKKYPD 86

Query: 115 RRIGQTSPPHSPAKSRRTT--PPPSPLALPRTQVTAVNGTTTQPRIQPEVEQKNIVYNKT 174
           R I QTSP  SP++S RT+  PPP PLALP TQ TAVNGTTTQP+IQPE+EQK+IVYNKT
Sbjct: 87  RWIAQTSPLQSPSRSLRTSPPPPPPPLALPPTQFTAVNGTTTQPKIQPEIEQKSIVYNKT 146

Query: 175 VERPAKSDRPSEYGSGKPPQREQ-AEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKET 234
           VE+PAK DR SEYGSGKP ++++ AE INL GHNVGAVME+++SS GHRLGGET +    
Sbjct: 147 VEKPAKFDRASEYGSGKPHEKQKAAEAINLAGHNVGAVMEIDKSSVGHRLGGETVRG--- 206

Query: 235 EGGGGCGVRHGNDQKKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAFSDA 294
                  VR GN++KK + KKELPMTAF NSNFQSVNNSVLY SSCNHRDPGLHL F+DA
Sbjct: 207 ------SVRDGNEEKKKKEKKELPMTAFTNSNFQSVNNSVLYGSSCNHRDPGLHLKFADA 266

Query: 295 ADGDGATVDGRKNYK 306
           ADGDGA  DGRKNYK
Sbjct: 267 ADGDGAAADGRKNYK 272

BLAST of Tan0005219 vs. TAIR 10
Match: AT2G46630.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 110095 Blast hits to 59224 proteins in 2216 species: Archae - 177; Bacteria - 15429; Metazoa - 38345; Fungi - 18843; Plants - 13341; Viruses - 3084; Other Eukaryotes - 20876 (source: NCBI BLink). )

HSP 1 Score: 65.5 bits (158), Expect = 8.7e-11
Identity = 99/349 (28.37%), Postives = 139/349 (39.83%), Query Frame = 0

Query: 13  RPSVVAPPVLAAAQPAAEPKPEFLPIAPATQTLQPLDPNLAVPAPASPLIVSPRQ---LP 72
           RP    P      QP + P+ +  P +P  Q  QPL P      P SP    P++     
Sbjct: 40  RPPAKQPSPPRQRQPRSPPRQQD-PPSPPRQQQQPLTPPRQKAPPTSP----PQERSPYH 99

Query: 73  SPVKKAMSPFASPRYGGPATRVPSPPPARVTRSPPISPAKKYPERRIGQTSPPHSPAKSR 132
           SP  + MSP   P+        P PPP R + + P SP +        + + P SPA S 
Sbjct: 100 SPPSRHMSPPTPPK-----AATPPPPPPRSSYTSPPSPKEVQEALPPRKPNSPPSPAHSS 159

Query: 133 RTT-------------------PPP---SPLALPRT------QVTAVNGTTTQPRIQP-E 192
           R+T                   P P   SP +LP +      + T  N  T +   Q  E
Sbjct: 160 RSTTSESVKTRSPSESENHRKAPSPRVLSPYSLPASLLHSERETTQKNILTAEKTSQTHE 219

Query: 193 VEQKNIVYNKTVERPAKSDRPSEYGSG------------KPPQREQAE------VINLTG 252
               N  +N    +    ++   Y               + P    +E      VI + G
Sbjct: 220 TNHHNQNHNHDYNQNHNYNQNHSYNQNQNHQGNNPKKMHRQPSSSDSENIMSTRVITIAG 279

Query: 253 HNVGAVMEVNQSSPGHRLGGE-TTKKKETEGGGGCGVR----------HGNDQKKT---- 295
            N GAVME+ +S  G++ GG  T   + + G G  G R           G  +KKT    
Sbjct: 280 ENKGAVMEILRSPQGNKTGGSGTHSSRVSHGTGEKGRRLQSSSSSSSDEGEGKKKTTKNV 339

BLAST of Tan0005219 vs. TAIR 10
Match: AT1G75260.1 (oxidoreductases, acting on NADH or NADPH )

HSP 1 Score: 48.9 bits (115), Expect = 8.4e-06
Identity = 34/107 (31.78%), Postives = 50/107 (46.73%), Query Frame = 0

Query: 193 EQAEVINLTGHNVGAVMEVNQSSPGHRLGGETTKKKETEGGGGCGVRHGNDQ-------- 252
           +   V  LTG N GA M +          G    KK+ E     G R   D+        
Sbjct: 341 KSVSVYTLTGENKGATMGI----------GSEKDKKDGEVHIRRGYRSNPDESSNTTATE 400

Query: 253 ----KKTEAKKELPMTAFMNSNFQSVNNSVLYDSSCNHRDPGLHLAF 288
               K  EA++E   TA++N N Q +NNS++ +SS +  DPG+H++F
Sbjct: 401 TENPKDDEAEEEASFTAYINGNTQGINNSIVVESSVSENDPGVHMSF 437

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038879417.11.5e-9766.34proline-rich receptor-like protein kinase PERK8 [Benincasa hispida][more]
KAG6597060.14.2e-9263.87hypothetical protein SDJN03_10240, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022951357.18.0e-9163.49sulfated surface glycoprotein 185-like [Cucurbita moschata][more]
XP_022974816.14.4e-8963.14wiskott-Aldrich syndrome protein family member 2-like [Cucurbita maxima] >XP_022... [more]
KAG6597059.19.7e-8961.54hypothetical protein SDJN03_10239, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1GHE53.8e-9163.49sulfated surface glycoprotein 185-like OS=Cucurbita moschata OX=3662 GN=LOC11145... [more]
A0A6J1ICG82.1e-8963.14wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita maxima OX=366... [more]
A0A5A7TZ241.5e-8764.84Zyxin-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00760 PE=... [more]
A0A1S3AV381.5e-8764.84zyxin-like OS=Cucumis melo OX=3656 GN=LOC103483158 PE=4 SV=1[more]
A0A6J1GDX34.1e-8570.20wiskott-Aldrich syndrome protein family member 2-like OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT2G46630.18.7e-1128.37unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXP... [more]
AT1G75260.18.4e-0631.78oxidoreductases, acting on NADH or NADPH [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 211..247
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 142..160
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 287..306
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..102
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 49..193
NoneNo IPR availablePANTHERPTHR33472OS01G0106600 PROTEINcoord: 193..291
NoneNo IPR availablePANTHERPTHR33472:SF1EXTENSIN-RELATEDcoord: 193..291

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0005219.1Tan0005219.1mRNA