Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGAGCAGAAATGGATTTGTGGATTGTTGCAGCAGCTACTGGTGCTGGATATGTAGCTAAGTACTGGAAGAATCAATCAAAGGATGGGGACTATTCTTCATCACAGTTATCCTTTGGGGAATCCATTTTGGTGAGTCCTCAATATTCAAATCATTTGTTCAACAAATTTTCACAAAGGAAGAAACCACATGAAGATGTTTTTGGACATGGAAGAATGGAAGAAACATCCTCAGTTGCAGATTTGGGACTTCTTGGGAGTCACCAAGATTCCAATGAACTGTCTACACCAAATATGACTTTGAAATCCTGGATCAATGAGAACTCTAAAGGATACAATGAAGAAAGCAGCAAAAGCAATACTATGGCTAATGACATTGGAACCTTAGTTTGCAGTTCTTCGGGAAGAACAGGCTCTTCAAGAAACAGAAGCACAGCTAAAGCTAAATTCTCACGTGGGGTTCTTATTAAACCTCTAAGTTTTGTAGAAGATTGTCTTCTATTCCATGAGCGTTCTTTAAGTCCATGTATTGCTGTGGAAATGGAGGAAAATAGGCTGTCCAAGGGATACCATGTAGATGCTACTGAAAGTGTTCGTGGGGTTTCTCAACTGCCATTTGGGTCTTTAAGGATCTCTGATGTAGTTAGTAATAAGACAGCAAAGGAATGGGAAAGAAAGTCAAGAAGTTCCAGCAAAATGGCTAACATGGAACACTTTGCTTCTAAAGGTTTAGTAGTCATTTTTATTCTGTTCTTTTGTTTTTTTGCTTATCTCATTAATTTATGTTCTTTGTTTACTACACTCCTTAGCCACTATCATATGAAAACCTTAGAAATTTGAAGTTTTTGTTGAATTTCTAATATCAAATTTTAGAATTTTAAAATCAGTTTGACATTAATTTAAAGAGAATAATTAGATTTGAATTATCTTTTGATCGAAGACCAAAAGATAAGGACTAGTTCCTTACTCCAAGTTCGAACACTCCACAAGTCCTAAATGTTCTTAGCATGCAACCTTGGCTCAAGAACTAGAAAGTGGTGATCAAGCTCTCTCATCTCCCAAAGCATTTCATTCATACAATGTACCTCTTCAAACCCTTAAAACCTTTCCCGCGTAGCAAAAAAGACTTAAAAGTCCATTTTACTCATAACTCAAAAATATATAATAAAACTAAGACATATATGGGTCCACAATTAAAATAACACCAACAAACTTTAGTAGCATCATATGAAACCAATTATAGAGTTTCTGCCCAGTTTTTGTTGTTTGAATTCCTCAATTGAGGTTTTTTCTTTTTTCTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTTTGATTGCTATTTTTAATTTAATTTTATTTTTATGAAATTTACAAAAAGAGGCCAAAGCTCAAGCTAAAGGTGTGGTTTCCAATTGGACAAAAGAGAAGAAAGATCGTATGAACTAAAGAGAGATGTAGATTTGCACCAACTCATAGCATGAAAGATAATGAATTCAAAATAGCAATCAAAGGAGCTTCCTCTATCATTGAATAGTCGGTTGTTTCTCTCCTTTCATGTAATTCAGAAGAAAAATAAATAAATAAAACCAGAATCAAATGCATCCAAAACGATCTTCTTTTGATTGGTAAAGGGATCTCCGGTGAGAGTATATGAGAGGAAGCTGACATGTCCTTAGGTAAGGCAGTGTACTCTCAACAGAGTTCATCTCTATTTTATGTAGGAATTCATAGTTAACTCTAGCTTCTTAGCCTTTGAACGTGTGATTGGTTGTGGAGGTAACACTAAGACATCACGCTGCCCCTTTGGAGTATAGTATACTTCTAATCATTCTTTTATCATTTGGTGTGTAGCGTTGGGCAAAGGAAATTATCAACTGAGGAAGAGATAAATCTATTGCAATATGAAAGTTTCATGAAAGTTGAGAAGTTCTTGAGAACCCTTCTTCTTTCTACATACTGATGATTGATCCATACCTCAGATTACTAAATTGTGCTTATTGCTTATACTTTTCAGGTGGCCCGGATAAATCGCTTGTATTATATCTTGGATTCTTCATTGGTGTAATCTATTCTTTCATATCCAATAACAGGGAAGTTCACGAGTTGAAAGAATTGTTAAAGCAGACTGAGTATTTGGTTCAAGATCTTCAAGAGGAACTTGAAATGAAGGACTCATTGAAGCTGAAGGAACTCAGTAATGATAATTGCGAATCATACGCTTATAGTAATGCCTTCTCTGAGAAAACAGGTGATGAATCTTCTCCAAAACATCTCATGGATTACACATTAGACTTCAATGCTGAAGAACTATATGAACATAAAGCAGAGGAGAGTTCTGAATCGATGAGCAGAATCGAAGCCGAACTGGAAGCGGAACTCGAGAGGTTAGGATTAAATATAAACACTGGTGGCACGGTGAGATTTCCTGACAACGAAGAGGAGGTAATGATGTACATTTGTTAAGTAGAGCCATGTTAGAAACATAGATTAATTCATACTAGGAATTAAACAAAGTGTAATGGTCTACTCATTGTTTATTATGGATGGTTTAAGTCAAAAAGCAAATCATGATGTGCAGCTTGACCCAGAATTTGAAGAAGATTATGCTGAAGGTGAGTTGAGGAATGAGATGATTAGTGAACAAAGTTCTGGTTGGACAAAGCATAATGAAGAGGCAAGCAACTCGACAGTTCAGTCTGGAAATTATACAGTGCCACCAAGGGAATTGAGCTTGCGTTTACATGATGTTATTCAGTCAAGACTTGAAGCACGAATCAAAGAGCTTGAGAATGCCCTTCAAAATAACCACAAGAAGCTGCAGCAAATTGATACACAATATAGGAGCTGTTCTTGGTTTAAATTAGCAGATGGTGAATTGGAGTTTAATTCTAATACAGAAGGTGAAAACAAAATTGCTAGCCTCGGAACA
mRNA sequence
GGAGAGCAGAAATGGATTTGTGGATTGTTGCAGCAGCTACTGGTGCTGGATATGTAGCTAAGTACTGGAAGAATCAATCAAAGGATGGGGACTATTCTTCATCACAGTTATCCTTTGGGGAATCCATTTTGGTGAGTCCTCAATATTCAAATCATTTGTTCAACAAATTTTCACAAAGGAAGAAACCACATGAAGATGTTTTTGGACATGGAAGAATGGAAGAAACATCCTCAGTTGCAGATTTGGGACTTCTTGGGAGTCACCAAGATTCCAATGAACTGTCTACACCAAATATGACTTTGAAATCCTGGATCAATGAGAACTCTAAAGGATACAATGAAGAAAGCAGCAAAAGCAATACTATGGCTAATGACATTGGAACCTTAGTTTGCAGTTCTTCGGGAAGAACAGGCTCTTCAAGAAACAGAAGCACAGCTAAAGCTAAATTCTCACGTGGGGTTCTTATTAAACCTCTAAGTTTTGTAGAAGATTGTCTTCTATTCCATGAGCGTTCTTTAAGTCCATGTATTGCTGTGGAAATGGAGGAAAATAGGCTGTCCAAGGGATACCATGTAGATGCTACTGAAAGTGTTCGTGGGGTTTCTCAACTGCCATTTGGGTCTTTAAGGATCTCTGATGTAGTTAGTAATAAGACAGCAAAGGAATGGGAAAGAAAGTCAAGAAGTTCCAGCAAAATGGCTAACATGGAACACTTTGCTTCTAAAGGTGGCCCGGATAAATCGCTTGTATTATATCTTGGATTCTTCATTGGTGTAATCTATTCTTTCATATCCAATAACAGGGAAGTTCACGAGTTGAAAGAATTGTTAAAGCAGACTGAGTATTTGGTTCAAGATCTTCAAGAGGAACTTGAAATGAAGGACTCATTGAAGCTGAAGGAACTCAGTAATGATAATTGCGAATCATACGCTTATAGTAATGCCTTCTCTGAGAAAACAGGTGATGAATCTTCTCCAAAACATCTCATGGATTACACATTAGACTTCAATGCTGAAGAACTATATGAACATAAAGCAGAGGAGAGTTCTGAATCGATGAGCAGAATCGAAGCCGAACTGGAAGCGGAACTCGAGAGGTTAGGATTAAATATAAACACTGGTGGCACGGTGAGATTTCCTGACAACGAAGAGGAGGTAATGATTCAAAAAGCAAATCATGATGTGCAGCTTGACCCAGAATTTGAAGAAGATTATGCTGAAGGTGAGTTGAGGAATGAGATGATTAGTGAACAAAGTTCTGGTTGGACAAAGCATAATGAAGAGGCAAGCAACTCGACAGTTCAGTCTGGAAATTATACAGTGCCACCAAGGGAATTGAGCTTGCGTTTACATGATGTTATTCAGTCAAGACTTGAAGCACGAATCAAAGAGCTTGAGAATGCCCTTCAAAATAACCACAAGAAGCTGCAGCAAATTGATACACAATATAGGAGCTGTTCTTGGTTTAAATTAGCAGATGGTGAATTGGAGTTTAATTCTAATACAGAAGGTGAAAACAAAATTGCTAGCCTCGGAACA
Coding sequence (CDS)
ATGGATTTGTGGATTGTTGCAGCAGCTACTGGTGCTGGATATGTAGCTAAGTACTGGAAGAATCAATCAAAGGATGGGGACTATTCTTCATCACAGTTATCCTTTGGGGAATCCATTTTGGTGAGTCCTCAATATTCAAATCATTTGTTCAACAAATTTTCACAAAGGAAGAAACCACATGAAGATGTTTTTGGACATGGAAGAATGGAAGAAACATCCTCAGTTGCAGATTTGGGACTTCTTGGGAGTCACCAAGATTCCAATGAACTGTCTACACCAAATATGACTTTGAAATCCTGGATCAATGAGAACTCTAAAGGATACAATGAAGAAAGCAGCAAAAGCAATACTATGGCTAATGACATTGGAACCTTAGTTTGCAGTTCTTCGGGAAGAACAGGCTCTTCAAGAAACAGAAGCACAGCTAAAGCTAAATTCTCACGTGGGGTTCTTATTAAACCTCTAAGTTTTGTAGAAGATTGTCTTCTATTCCATGAGCGTTCTTTAAGTCCATGTATTGCTGTGGAAATGGAGGAAAATAGGCTGTCCAAGGGATACCATGTAGATGCTACTGAAAGTGTTCGTGGGGTTTCTCAACTGCCATTTGGGTCTTTAAGGATCTCTGATGTAGTTAGTAATAAGACAGCAAAGGAATGGGAAAGAAAGTCAAGAAGTTCCAGCAAAATGGCTAACATGGAACACTTTGCTTCTAAAGGTGGCCCGGATAAATCGCTTGTATTATATCTTGGATTCTTCATTGGTGTAATCTATTCTTTCATATCCAATAACAGGGAAGTTCACGAGTTGAAAGAATTGTTAAAGCAGACTGAGTATTTGGTTCAAGATCTTCAAGAGGAACTTGAAATGAAGGACTCATTGAAGCTGAAGGAACTCAGTAATGATAATTGCGAATCATACGCTTATAGTAATGCCTTCTCTGAGAAAACAGGTGATGAATCTTCTCCAAAACATCTCATGGATTACACATTAGACTTCAATGCTGAAGAACTATATGAACATAAAGCAGAGGAGAGTTCTGAATCGATGAGCAGAATCGAAGCCGAACTGGAAGCGGAACTCGAGAGGTTAGGATTAAATATAAACACTGGTGGCACGGTGAGATTTCCTGACAACGAAGAGGAGGTAATGATTCAAAAAGCAAATCATGATGTGCAGCTTGACCCAGAATTTGAAGAAGATTATGCTGAAGGTGAGTTGAGGAATGAGATGATTAGTGAACAAAGTTCTGGTTGGACAAAGCATAATGAAGAGGCAAGCAACTCGACAGTTCAGTCTGGAAATTATACAGTGCCACCAAGGGAATTGAGCTTGCGTTTACATGATGTTATTCAGTCAAGACTTGAAGCACGAATCAAAGAGCTTGAGAATGCCCTTCAAAATAACCACAAGAAGCTGCAGCAAATTGATACACAATATAGGAGCTGTTCTTGGTTTAAATTAGCAGATGGTGAATTGGAGTTTAATTCTAATACAGAAGGTGAAAACAAAATTGCTAGCCTCGGAACA
Protein sequence
MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPHEDVFGHGRMEETSSVADLGLLGSHQDSNELSTPNMTLKSWINENSKGYNEESSKSNTMANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLLFHERSLSPCIAVEMEENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSSKMANMEHFASKGGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEELEMKDSLKLKELSNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSESMSRIEAELEAELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELRNEMISEQSSGWTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQQIDTQYRSCSWFKLADGELEFNSNTEGENKIASLGT
Homology
BLAST of CaUC04G077740 vs. NCBI nr
Match:
XP_038895324.1 (uncharacterized protein LOC120083576 [Benincasa hispida])
HSP 1 Score: 770.4 bits (1988), Expect = 9.8e-219
Identity = 418/508 (82.28%), Postives = 445/508 (87.60%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGY AKYWKNQSKDGDYSSSQL FGES LVSPQYSNH NKFSQ+KK +
Sbjct: 1 MDLWIVAAATGAGYAAKYWKNQSKDGDYSSSQLYFGESNLVSPQYSNHFVNKFSQKKKLY 60
Query: 61 EDVFGHGRMEETSSVADLGL--LGSHQDSNELSTPNMTLKSWINENSKGYNEESSKSNTM 120
EDVFGHGRMEETSSVAD GL GSHQDSNE STPNMTLKSWINENSK + ESSKS
Sbjct: 61 EDVFGHGRMEETSSVADFGLFVFGSHQDSNEQSTPNMTLKSWINENSKEHIGESSKS--- 120
Query: 121 ANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLLFHERSLSPCIAVEME 180
N+IGTLVCSSS S+RNRSTAKA+FSRGVLIKPLS VED LL +E SL CIAVEME
Sbjct: 121 -NNIGTLVCSSS---VSARNRSTAKARFSRGVLIKPLSSVEDYLLTYESSLDSCIAVEME 180
Query: 181 ENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSSKMANMEHFASK 240
EN+LSKG H+DA+ESV GVS LPF SL+I+DVVSNKT KEWERKSRS SKMANMEHFASK
Sbjct: 181 ENKLSKGSHLDASESVYGVSLLPFESLKITDVVSNKTGKEWERKSRSCSKMANMEHFASK 240
Query: 241 GGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEELEMKDSLKLKEL 300
G PD+S VLYLGFFIG++YSFISNNREVH+LKELLKQ+EYLVQDLQEELEMKDSLKLKEL
Sbjct: 241 GAPDQSFVLYLGFFIGMMYSFISNNREVHKLKELLKQSEYLVQDLQEELEMKDSLKLKEL 300
Query: 301 SNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSESMSRIEAELEA 360
SNDNCESY YSNAFSEKTGDESSPKH+MDYTL+FNAEELYEHKA ESSESMSRIEAELEA
Sbjct: 301 SNDNCESYTYSNAFSEKTGDESSPKHVMDYTLNFNAEELYEHKAGESSESMSRIEAELEA 360
Query: 361 ELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELRNEMISEQSSGW 420
ELERLGLNI+T GT RF DNEE LDPEFEED+AEGELRNEMISEQSSGW
Sbjct: 361 ELERLGLNISTEGTGRFSDNEE------------LDPEFEEDFAEGELRNEMISEQSSGW 420
Query: 421 TKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQQIDTQ 480
TKHNEE SNSTVQSGNY+V PREL LRLHDVIQSRLEARIKELENAL NNH+KLQQID +
Sbjct: 421 TKHNEEGSNSTVQSGNYSVSPRELRLRLHDVIQSRLEARIKELENALLNNHQKLQQIDAE 480
Query: 481 YRSCSWFKLADGELEFNSNTEGENKIAS 507
YRSCSW +LADGELEF +N E E KIA+
Sbjct: 481 YRSCSWLELADGELEF-TNRENETKIAT 488
BLAST of CaUC04G077740 vs. NCBI nr
Match:
XP_008449909.1 (PREDICTED: uncharacterized protein LOC103491621 [Cucumis melo] >TYJ98934.1 pericentriolar material 1 protein [Cucumis melo var. makuwa])
HSP 1 Score: 709.9 bits (1831), Expect = 1.6e-200
Identity = 389/501 (77.64%), Postives = 422/501 (84.23%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGYVAKYWKNQSKDGD SQLSFGES LVSPQYSNH NKFS RKKP+
Sbjct: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDNPLSQLSFGESNLVSPQYSNHFLNKFSLRKKPY 60
Query: 61 EDVFGHGRMEETSSVADLGLLGSHQDS--NELSTPNMTLKSWINENSKGYNEESSKSNTM 120
EDVFGHG MEETSSVA+LGL GSHQDS NELST NMTLKSWI+ENSK + ESSKS
Sbjct: 61 EDVFGHGSMEETSSVAELGLFGSHQDSNGNELSTTNMTLKSWISENSKEHIGESSKS--- 120
Query: 121 ANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLLFHERSLSPCIAVEME 180
N+IGTLVCS SSRNRST KAKFS GVL+KPL+FVEDCLL HE SL+P IAVE+E
Sbjct: 121 -NNIGTLVCS------SSRNRSTTKAKFSDGVLVKPLNFVEDCLLAHESSLNPRIAVELE 180
Query: 181 ENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSSKMANMEHFASK 240
EN+L KG H+DA ES+ GVSQLPF SL+ISD+V NKT KEWERKSRS SKMAN EHF+SK
Sbjct: 181 ENKLFKGSHLDANESLCGVSQLPFESLKISDIVINKTGKEWERKSRSFSKMANREHFSSK 240
Query: 241 GGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEELEMKDSLKLKEL 300
G D+S +LY G FIGVI+SF+SN REVH LKELLKQTEYLV DLQEELEMKDSLKLKEL
Sbjct: 241 GVADESFLLYFGVFIGVIFSFMSNKREVHNLKELLKQTEYLVHDLQEELEMKDSLKLKEL 300
Query: 301 SNDNCESYAYS-NAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSESMSRIEAELE 360
SNDNCESY YS N FSEKTGD SSP+H+MDYT++FNAEELYEHKAEESSESMSRIEAELE
Sbjct: 301 SNDNCESYTYSNNVFSEKTGDGSSPQHVMDYTINFNAEELYEHKAEESSESMSRIEAELE 360
Query: 361 AELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELRNEMISEQSSG 420
AELERLGLN++ T R+ + EEE LDPEFEED+AEGELRNEMI E+S G
Sbjct: 361 AELERLGLNVSIDCTARYYEEEEE-----------LDPEFEEDFAEGELRNEMIIEESCG 420
Query: 421 WTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQQIDT 480
WTK NEE SNSTV SGNYTV PRELSLRLHDVIQSRLEARIKELENALQNNHKKLQ+IDT
Sbjct: 421 WTKPNEEESNSTVHSGNYTVSPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQKIDT 479
Query: 481 QYRSCSWFKLADGELEFNSNT 499
QYRS SW ++ D +LEF SNT
Sbjct: 481 QYRS-SWLEVDDDDLEFTSNT 479
BLAST of CaUC04G077740 vs. NCBI nr
Match:
XP_011650179.2 (uncharacterized protein LOC105434747 [Cucumis sativus] >KGN64051.1 hypothetical protein Csa_014333 [Cucumis sativus])
HSP 1 Score: 697.6 bits (1799), Expect = 8.0e-197
Identity = 386/501 (77.05%), Postives = 418/501 (83.43%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLW+VAAATGAGYVAKYWKNQSKDGD S SQLSFGES LVSP+YSN +KFS RKKP+
Sbjct: 1 MDLWVVAAATGAGYVAKYWKNQSKDGDNSLSQLSFGESNLVSPEYSNLFLDKFSLRKKPY 60
Query: 61 EDVFGHGRMEETSSVADLGLLGSHQDS--NELSTPNMTLKSWINENSKGYNEESSKSNTM 120
EDVFGHG MEET SV++LGL+GSHQ S NEL T NMTLKSWINENSKG+ ESSKS
Sbjct: 61 EDVFGHGIMEETPSVSELGLIGSHQGSNGNELPTTNMTLKSWINENSKGHIGESSKS--- 120
Query: 121 ANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLLFHERSLSPCIAVEME 180
N+IGTLVCS SSRNRST AKFS GVL+KPL+ VEDCLL HE SL+PCIAVE+E
Sbjct: 121 -NNIGTLVCS------SSRNRSTGNAKFSNGVLVKPLNLVEDCLLAHESSLNPCIAVELE 180
Query: 181 ENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSSKMANMEHFASK 240
EN+L KG H+DA ES+ GVSQLPF SL+ISD+VSNKT KEWERKSRS SKM N EH ASK
Sbjct: 181 ENKLFKGSHLDANESLCGVSQLPFESLKISDIVSNKTGKEWERKSRSFSKMDNREHSASK 240
Query: 241 GGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEELEMKDSLKLKEL 300
G D+S VLYLG FIGVI+SF+SN REVH LKELLKQTE LVQDLQEELEMKDSLKLKEL
Sbjct: 241 GVADESFVLYLGVFIGVIFSFMSNKREVHNLKELLKQTEDLVQDLQEELEMKDSLKLKEL 300
Query: 301 SNDNCESYAYS-NAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSESMSRIEAELE 360
SNDNCESY YS NAFSEKT D SS +H+MDYT++FNAEELYEHKAEESSESMSRIEAELE
Sbjct: 301 SNDNCESYTYSNNAFSEKTADGSSTQHVMDYTINFNAEELYEHKAEESSESMSRIEAELE 360
Query: 361 AELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELRNEMISEQSSG 420
AELERLGLN++ T RF + EEE LDPEFEED+AEGELRNEMI E+S G
Sbjct: 361 AELERLGLNVSIDCTARFHEEEEE-----------LDPEFEEDFAEGELRNEMIIEESCG 420
Query: 421 WTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQQIDT 480
WTK NEE SNSTV SGNYTV PRELSLRLHDVIQSRLEARIKELENALQNN KKLQQID
Sbjct: 421 WTKPNEEESNSTVHSGNYTVSPRELSLRLHDVIQSRLEARIKELENALQNNSKKLQQIDA 479
Query: 481 QYRSCSWFKLADGELEFNSNT 499
QYRS SW ++AD ELEF SNT
Sbjct: 481 QYRS-SWLEVADDELEFISNT 479
BLAST of CaUC04G077740 vs. NCBI nr
Match:
XP_022923688.1 (uncharacterized protein LOC111431325 isoform X1 [Cucurbita moschata] >XP_022923689.1 uncharacterized protein LOC111431325 isoform X1 [Cucurbita moschata])
HSP 1 Score: 639.0 bits (1647), Expect = 3.4e-179
Identity = 361/509 (70.92%), Postives = 405/509 (79.57%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGYVAKYWKNQSKDGD S SQLSFGES LVSPQYSNHL +KFSQRKK +
Sbjct: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDCSLSQLSFGESNLVSPQYSNHLLSKFSQRKKRY 60
Query: 61 EDVFGHGRMEETSS---------------------VADLGLLGSHQDSNELSTPNMTLKS 120
EDVFGHG ME TSS + LGL GSHQD N STPNMTL S
Sbjct: 61 EDVFGHGGMEGTSSDQNPSDIASVAEMACISGGFEIGKLGLFGSHQDFNVFSTPNMTLTS 120
Query: 121 WINENSKGYNEESSKSNTMANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVE 180
E G ESSKSNTMANDIG LVC+SSGRT SSRNRSTAK ++ G LIKPL+ E
Sbjct: 121 ---EERVG---ESSKSNTMANDIGILVCNSSGRTASSRNRSTAKGRYLHGDLIKPLTCEE 180
Query: 181 DCLLFHERSLSPCIAVEMEENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSN-KTAKE 240
DCL H+ SLSPCIAVE+EE++L KG H + ESV VSQLPFG L SD VSN KT KE
Sbjct: 181 DCLPAHQSSLSPCIAVELEEHKLPKGSHFNGDESVCAVSQLPFGPL--SDTVSNKKTGKE 240
Query: 241 WERKSRSSSKMANMEHFASKGGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEY 300
WERKSRSS+KMAN EHF SKGGPD+SLVLYLGFFIG+IYSFISN REVH+LKELLKQTE
Sbjct: 241 WERKSRSSNKMANREHFDSKGGPDESLVLYLGFFIGIIYSFISNKREVHKLKELLKQTES 300
Query: 301 LVQDLQEELEMKDSLKLKELSNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNAEELY 360
LV+DLQEELEMKDSL LKELSND YSN FSEKT DESSPKH+MDYT++FNAEELY
Sbjct: 301 LVEDLQEELEMKDSLTLKELSND-----TYSNGFSEKTVDESSPKHVMDYTVNFNAEELY 360
Query: 361 EHKAEESSESMSRIEAELEAELERLGLNINTGGTVRFP-DNEEEVMIQKANHDVQLDPEF 420
+H AE+SSES+ RIEAELEAELERLGLNI+T GT R+P D++++ ++ H VQL PEF
Sbjct: 361 KHMAEQSSESIHRIEAELEAELERLGLNISTEGTERYPDDDDDDDENHRSIHSVQLGPEF 420
Query: 421 EEDYAEGELRNEMISEQSSGWTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEAR 480
EED+AEGELRN+M+S +S GW+K N+EA+ STV SGNYTV PRELSLRLHDVIQSRLEAR
Sbjct: 421 EEDFAEGELRNDMMSGESCGWSKANKEANESTVHSGNYTVSPRELSLRLHDVIQSRLEAR 480
Query: 481 IKELENALQNNHKKLQQIDTQYRSCSWFK 487
I+ELENALQNN+KK Q ++T+Y S SW +
Sbjct: 481 IEELENALQNNYKKQQLLNTEYTS-SWLE 495
BLAST of CaUC04G077740 vs. NCBI nr
Match:
KAG6584411.1 (hypothetical protein SDJN03_20343, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 636.3 bits (1640), Expect = 2.2e-178
Identity = 359/508 (70.67%), Postives = 403/508 (79.33%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGYVAKYWKNQSKDGD S SQLSFGES LVSP+YSNHL +KFSQRKK +
Sbjct: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDCSLSQLSFGESNLVSPKYSNHLLSKFSQRKKRY 60
Query: 61 EDVFGHGRMEETSS---------------------VADLGLLGSHQDSNELSTPNMTLKS 120
EDVFGHG ME TSS + LGL GSHQD N STPNMTL S
Sbjct: 61 EDVFGHGGMEGTSSDQNPSDIASVAEMACISGGFEIGKLGLFGSHQDFNVFSTPNMTLTS 120
Query: 121 WINENSKGYNEESSKSNTMANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVE 180
E G ESSKSNTMANDIG LVC+SSGRT SSRNRSTAK ++ G LIKPL++ E
Sbjct: 121 ---EERVG---ESSKSNTMANDIGILVCNSSGRTASSRNRSTAKGRYLHGDLIKPLTYEE 180
Query: 181 DCLLFHERSLSPCIAVEMEENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSN-KTAKE 240
DCL H+ SLSPCIAVE+EE++L KG H + ESV VSQLPFG L SD VSN KT KE
Sbjct: 181 DCLPAHQSSLSPCIAVELEEHKLPKGSHFNGDESVCAVSQLPFGPL--SDTVSNKKTGKE 240
Query: 241 WERKSRSSSKMANMEHFASKGGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEY 300
WERKSRSS+KMAN EHF SKGGPD+SLVLYLGFFIG+IYSFISN REVHELKELLKQTE
Sbjct: 241 WERKSRSSNKMANREHFDSKGGPDESLVLYLGFFIGIIYSFISNKREVHELKELLKQTES 300
Query: 301 LVQDLQEELEMKDSLKLKELSNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNAEELY 360
LV+DLQEELEMKDSL LKELSND YSN FSEKT DESSPKH+MDYT++FNAEELY
Sbjct: 301 LVEDLQEELEMKDSLTLKELSND-----TYSNGFSEKTVDESSPKHVMDYTVNFNAEELY 360
Query: 361 EHKAEESSESMSRIEAELEAELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFE 420
+H AE+SSES+ RIEAELEAELERLGLNI+T GT R+PD++++ D +L PEFE
Sbjct: 361 KHMAEQSSESIHRIEAELEAELERLGLNISTEGTERYPDDDDD--------DDELGPEFE 420
Query: 421 EDYAEGELRNEMISEQSSGWTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARI 480
ED+AEGELRN+M+S +S GW+K N+EA+ STV SGNYTV PRELSLRLHDVIQSRLEARI
Sbjct: 421 EDFAEGELRNDMMSGESCGWSKANKEANESTVHSGNYTVSPRELSLRLHDVIQSRLEARI 480
Query: 481 KELENALQNNHKKLQQIDTQYRSCSWFK 487
+ELENALQNN+KK Q ++T+Y S SW +
Sbjct: 481 EELENALQNNYKKQQLLNTEYTS-SWLE 486
BLAST of CaUC04G077740 vs. ExPASy TrEMBL
Match:
A0A5D3BIP8 (Pericentriolar material 1 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G001350 PE=4 SV=1)
HSP 1 Score: 709.9 bits (1831), Expect = 7.6e-201
Identity = 389/501 (77.64%), Postives = 422/501 (84.23%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGYVAKYWKNQSKDGD SQLSFGES LVSPQYSNH NKFS RKKP+
Sbjct: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDNPLSQLSFGESNLVSPQYSNHFLNKFSLRKKPY 60
Query: 61 EDVFGHGRMEETSSVADLGLLGSHQDS--NELSTPNMTLKSWINENSKGYNEESSKSNTM 120
EDVFGHG MEETSSVA+LGL GSHQDS NELST NMTLKSWI+ENSK + ESSKS
Sbjct: 61 EDVFGHGSMEETSSVAELGLFGSHQDSNGNELSTTNMTLKSWISENSKEHIGESSKS--- 120
Query: 121 ANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLLFHERSLSPCIAVEME 180
N+IGTLVCS SSRNRST KAKFS GVL+KPL+FVEDCLL HE SL+P IAVE+E
Sbjct: 121 -NNIGTLVCS------SSRNRSTTKAKFSDGVLVKPLNFVEDCLLAHESSLNPRIAVELE 180
Query: 181 ENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSSKMANMEHFASK 240
EN+L KG H+DA ES+ GVSQLPF SL+ISD+V NKT KEWERKSRS SKMAN EHF+SK
Sbjct: 181 ENKLFKGSHLDANESLCGVSQLPFESLKISDIVINKTGKEWERKSRSFSKMANREHFSSK 240
Query: 241 GGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEELEMKDSLKLKEL 300
G D+S +LY G FIGVI+SF+SN REVH LKELLKQTEYLV DLQEELEMKDSLKLKEL
Sbjct: 241 GVADESFLLYFGVFIGVIFSFMSNKREVHNLKELLKQTEYLVHDLQEELEMKDSLKLKEL 300
Query: 301 SNDNCESYAYS-NAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSESMSRIEAELE 360
SNDNCESY YS N FSEKTGD SSP+H+MDYT++FNAEELYEHKAEESSESMSRIEAELE
Sbjct: 301 SNDNCESYTYSNNVFSEKTGDGSSPQHVMDYTINFNAEELYEHKAEESSESMSRIEAELE 360
Query: 361 AELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELRNEMISEQSSG 420
AELERLGLN++ T R+ + EEE LDPEFEED+AEGELRNEMI E+S G
Sbjct: 361 AELERLGLNVSIDCTARYYEEEEE-----------LDPEFEEDFAEGELRNEMIIEESCG 420
Query: 421 WTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQQIDT 480
WTK NEE SNSTV SGNYTV PRELSLRLHDVIQSRLEARIKELENALQNNHKKLQ+IDT
Sbjct: 421 WTKPNEEESNSTVHSGNYTVSPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQKIDT 479
Query: 481 QYRSCSWFKLADGELEFNSNT 499
QYRS SW ++ D +LEF SNT
Sbjct: 481 QYRS-SWLEVDDDDLEFTSNT 479
BLAST of CaUC04G077740 vs. ExPASy TrEMBL
Match:
A0A1S3BN48 (uncharacterized protein LOC103491621 OS=Cucumis melo OX=3656 GN=LOC103491621 PE=4 SV=1)
HSP 1 Score: 709.9 bits (1831), Expect = 7.6e-201
Identity = 389/501 (77.64%), Postives = 422/501 (84.23%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGYVAKYWKNQSKDGD SQLSFGES LVSPQYSNH NKFS RKKP+
Sbjct: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDNPLSQLSFGESNLVSPQYSNHFLNKFSLRKKPY 60
Query: 61 EDVFGHGRMEETSSVADLGLLGSHQDS--NELSTPNMTLKSWINENSKGYNEESSKSNTM 120
EDVFGHG MEETSSVA+LGL GSHQDS NELST NMTLKSWI+ENSK + ESSKS
Sbjct: 61 EDVFGHGSMEETSSVAELGLFGSHQDSNGNELSTTNMTLKSWISENSKEHIGESSKS--- 120
Query: 121 ANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLLFHERSLSPCIAVEME 180
N+IGTLVCS SSRNRST KAKFS GVL+KPL+FVEDCLL HE SL+P IAVE+E
Sbjct: 121 -NNIGTLVCS------SSRNRSTTKAKFSDGVLVKPLNFVEDCLLAHESSLNPRIAVELE 180
Query: 181 ENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSSKMANMEHFASK 240
EN+L KG H+DA ES+ GVSQLPF SL+ISD+V NKT KEWERKSRS SKMAN EHF+SK
Sbjct: 181 ENKLFKGSHLDANESLCGVSQLPFESLKISDIVINKTGKEWERKSRSFSKMANREHFSSK 240
Query: 241 GGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEELEMKDSLKLKEL 300
G D+S +LY G FIGVI+SF+SN REVH LKELLKQTEYLV DLQEELEMKDSLKLKEL
Sbjct: 241 GVADESFLLYFGVFIGVIFSFMSNKREVHNLKELLKQTEYLVHDLQEELEMKDSLKLKEL 300
Query: 301 SNDNCESYAYS-NAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSESMSRIEAELE 360
SNDNCESY YS N FSEKTGD SSP+H+MDYT++FNAEELYEHKAEESSESMSRIEAELE
Sbjct: 301 SNDNCESYTYSNNVFSEKTGDGSSPQHVMDYTINFNAEELYEHKAEESSESMSRIEAELE 360
Query: 361 AELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELRNEMISEQSSG 420
AELERLGLN++ T R+ + EEE LDPEFEED+AEGELRNEMI E+S G
Sbjct: 361 AELERLGLNVSIDCTARYYEEEEE-----------LDPEFEEDFAEGELRNEMIIEESCG 420
Query: 421 WTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQQIDT 480
WTK NEE SNSTV SGNYTV PRELSLRLHDVIQSRLEARIKELENALQNNHKKLQ+IDT
Sbjct: 421 WTKPNEEESNSTVHSGNYTVSPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQKIDT 479
Query: 481 QYRSCSWFKLADGELEFNSNT 499
QYRS SW ++ D +LEF SNT
Sbjct: 481 QYRS-SWLEVDDDDLEFTSNT 479
BLAST of CaUC04G077740 vs. ExPASy TrEMBL
Match:
A0A0A0LVL9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G039170 PE=4 SV=1)
HSP 1 Score: 697.6 bits (1799), Expect = 3.9e-197
Identity = 386/501 (77.05%), Postives = 418/501 (83.43%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLW+VAAATGAGYVAKYWKNQSKDGD S SQLSFGES LVSP+YSN +KFS RKKP+
Sbjct: 1 MDLWVVAAATGAGYVAKYWKNQSKDGDNSLSQLSFGESNLVSPEYSNLFLDKFSLRKKPY 60
Query: 61 EDVFGHGRMEETSSVADLGLLGSHQDS--NELSTPNMTLKSWINENSKGYNEESSKSNTM 120
EDVFGHG MEET SV++LGL+GSHQ S NEL T NMTLKSWINENSKG+ ESSKS
Sbjct: 61 EDVFGHGIMEETPSVSELGLIGSHQGSNGNELPTTNMTLKSWINENSKGHIGESSKS--- 120
Query: 121 ANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLLFHERSLSPCIAVEME 180
N+IGTLVCS SSRNRST AKFS GVL+KPL+ VEDCLL HE SL+PCIAVE+E
Sbjct: 121 -NNIGTLVCS------SSRNRSTGNAKFSNGVLVKPLNLVEDCLLAHESSLNPCIAVELE 180
Query: 181 ENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSSKMANMEHFASK 240
EN+L KG H+DA ES+ GVSQLPF SL+ISD+VSNKT KEWERKSRS SKM N EH ASK
Sbjct: 181 ENKLFKGSHLDANESLCGVSQLPFESLKISDIVSNKTGKEWERKSRSFSKMDNREHSASK 240
Query: 241 GGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEELEMKDSLKLKEL 300
G D+S VLYLG FIGVI+SF+SN REVH LKELLKQTE LVQDLQEELEMKDSLKLKEL
Sbjct: 241 GVADESFVLYLGVFIGVIFSFMSNKREVHNLKELLKQTEDLVQDLQEELEMKDSLKLKEL 300
Query: 301 SNDNCESYAYS-NAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSESMSRIEAELE 360
SNDNCESY YS NAFSEKT D SS +H+MDYT++FNAEELYEHKAEESSESMSRIEAELE
Sbjct: 301 SNDNCESYTYSNNAFSEKTADGSSTQHVMDYTINFNAEELYEHKAEESSESMSRIEAELE 360
Query: 361 AELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELRNEMISEQSSG 420
AELERLGLN++ T RF + EEE LDPEFEED+AEGELRNEMI E+S G
Sbjct: 361 AELERLGLNVSIDCTARFHEEEEE-----------LDPEFEEDFAEGELRNEMIIEESCG 420
Query: 421 WTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQNNHKKLQQIDT 480
WTK NEE SNSTV SGNYTV PRELSLRLHDVIQSRLEARIKELENALQNN KKLQQID
Sbjct: 421 WTKPNEEESNSTVHSGNYTVSPRELSLRLHDVIQSRLEARIKELENALQNNSKKLQQIDA 479
Query: 481 QYRSCSWFKLADGELEFNSNT 499
QYRS SW ++AD ELEF SNT
Sbjct: 481 QYRS-SWLEVADDELEFISNT 479
BLAST of CaUC04G077740 vs. ExPASy TrEMBL
Match:
A0A6J1E743 (uncharacterized protein LOC111431325 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111431325 PE=4 SV=1)
HSP 1 Score: 639.0 bits (1647), Expect = 1.6e-179
Identity = 361/509 (70.92%), Postives = 405/509 (79.57%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGYVAKYWKNQSKDGD S SQLSFGES LVSPQYSNHL +KFSQRKK +
Sbjct: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDCSLSQLSFGESNLVSPQYSNHLLSKFSQRKKRY 60
Query: 61 EDVFGHGRMEETSS---------------------VADLGLLGSHQDSNELSTPNMTLKS 120
EDVFGHG ME TSS + LGL GSHQD N STPNMTL S
Sbjct: 61 EDVFGHGGMEGTSSDQNPSDIASVAEMACISGGFEIGKLGLFGSHQDFNVFSTPNMTLTS 120
Query: 121 WINENSKGYNEESSKSNTMANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVE 180
E G ESSKSNTMANDIG LVC+SSGRT SSRNRSTAK ++ G LIKPL+ E
Sbjct: 121 ---EERVG---ESSKSNTMANDIGILVCNSSGRTASSRNRSTAKGRYLHGDLIKPLTCEE 180
Query: 181 DCLLFHERSLSPCIAVEMEENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSN-KTAKE 240
DCL H+ SLSPCIAVE+EE++L KG H + ESV VSQLPFG L SD VSN KT KE
Sbjct: 181 DCLPAHQSSLSPCIAVELEEHKLPKGSHFNGDESVCAVSQLPFGPL--SDTVSNKKTGKE 240
Query: 241 WERKSRSSSKMANMEHFASKGGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEY 300
WERKSRSS+KMAN EHF SKGGPD+SLVLYLGFFIG+IYSFISN REVH+LKELLKQTE
Sbjct: 241 WERKSRSSNKMANREHFDSKGGPDESLVLYLGFFIGIIYSFISNKREVHKLKELLKQTES 300
Query: 301 LVQDLQEELEMKDSLKLKELSNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNAEELY 360
LV+DLQEELEMKDSL LKELSND YSN FSEKT DESSPKH+MDYT++FNAEELY
Sbjct: 301 LVEDLQEELEMKDSLTLKELSND-----TYSNGFSEKTVDESSPKHVMDYTVNFNAEELY 360
Query: 361 EHKAEESSESMSRIEAELEAELERLGLNINTGGTVRFP-DNEEEVMIQKANHDVQLDPEF 420
+H AE+SSES+ RIEAELEAELERLGLNI+T GT R+P D++++ ++ H VQL PEF
Sbjct: 361 KHMAEQSSESIHRIEAELEAELERLGLNISTEGTERYPDDDDDDDENHRSIHSVQLGPEF 420
Query: 421 EEDYAEGELRNEMISEQSSGWTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEAR 480
EED+AEGELRN+M+S +S GW+K N+EA+ STV SGNYTV PRELSLRLHDVIQSRLEAR
Sbjct: 421 EEDFAEGELRNDMMSGESCGWSKANKEANESTVHSGNYTVSPRELSLRLHDVIQSRLEAR 480
Query: 481 IKELENALQNNHKKLQQIDTQYRSCSWFK 487
I+ELENALQNN+KK Q ++T+Y S SW +
Sbjct: 481 IEELENALQNNYKKQQLLNTEYTS-SWLE 495
BLAST of CaUC04G077740 vs. ExPASy TrEMBL
Match:
A0A6J1ECL1 (uncharacterized protein LOC111431325 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431325 PE=4 SV=1)
HSP 1 Score: 634.4 bits (1635), Expect = 4.0e-178
Identity = 359/508 (70.67%), Postives = 402/508 (79.13%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLWIVAAATGAGYVAKYWKNQSKDGD S SQLSFGES LVSPQYSNHL +KFSQRKK +
Sbjct: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDCSLSQLSFGESNLVSPQYSNHLLSKFSQRKKRY 60
Query: 61 EDVFGHGRMEETSS---------------------VADLGLLGSHQDSNELSTPNMTLKS 120
EDVFGHG ME TSS + LGL GSHQD N STPNMTL S
Sbjct: 61 EDVFGHGGMEGTSSDQNPSDIASVAEMACISGGFEIGKLGLFGSHQDFNVFSTPNMTLTS 120
Query: 121 WINENSKGYNEESSKSNTMANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVE 180
E G ESSKSNTMANDIG LVC+SSGRT SSRNRSTAK ++ G LIKPL+ E
Sbjct: 121 ---EERVG---ESSKSNTMANDIGILVCNSSGRTASSRNRSTAKGRYLHGDLIKPLTCEE 180
Query: 181 DCLLFHERSLSPCIAVEMEENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSN-KTAKE 240
DCL H+ SLSPCIAVE+EE++L KG H + ESV VSQLPFG L SD VSN KT KE
Sbjct: 181 DCLPAHQSSLSPCIAVELEEHKLPKGSHFNGDESVCAVSQLPFGPL--SDTVSNKKTGKE 240
Query: 241 WERKSRSSSKMANMEHFASKGGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEY 300
WERKSRSS+KMAN EHF SKGGPD+SLVLYLGFFIG+IYSFISN REVH+LKELLKQTE
Sbjct: 241 WERKSRSSNKMANREHFDSKGGPDESLVLYLGFFIGIIYSFISNKREVHKLKELLKQTES 300
Query: 301 LVQDLQEELEMKDSLKLKELSNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNAEELY 360
LV+DLQEELEMKDSL LKELSND YSN FSEKT DESSPKH+MDYT++FNAEELY
Sbjct: 301 LVEDLQEELEMKDSLTLKELSND-----TYSNGFSEKTVDESSPKHVMDYTVNFNAEELY 360
Query: 361 EHKAEESSESMSRIEAELEAELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFE 420
+H AE+SSES+ RIEAELEAELERLGLNI+T GT R+PD++++ D +L PEFE
Sbjct: 361 KHMAEQSSESIHRIEAELEAELERLGLNISTEGTERYPDDDDD--------DDELGPEFE 420
Query: 421 EDYAEGELRNEMISEQSSGWTKHNEEASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARI 480
ED+AEGELRN+M+S +S GW+K N+EA+ STV SGNYTV PRELSLRLHDVIQSRLEARI
Sbjct: 421 EDFAEGELRNDMMSGESCGWSKANKEANESTVHSGNYTVSPRELSLRLHDVIQSRLEARI 480
Query: 481 KELENALQNNHKKLQQIDTQYRSCSWFK 487
+ELENALQNN+KK Q ++T+Y S SW +
Sbjct: 481 EELENALQNNYKKQQLLNTEYTS-SWLE 486
BLAST of CaUC04G077740 vs. TAIR 10
Match:
AT5G61040.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G08010.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 165.6 bits (418), Expect = 1.0e-40
Identity = 157/516 (30.43%), Postives = 246/516 (47.67%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MD+W++AA GY+AK +N +K D + L + L ++ + KK +
Sbjct: 1 MDVWLIAATAATGYIAKQLQNVTKGKD---NVLESSSEDVKPESPPGCLLSRLVRVKKAN 60
Query: 61 EDVFGHGRMEETSSVADLGLLG---SHQDSNELST--------PNMTLKSWINENSKGYN 120
E+ FG +M D G + ++N T P M L +W
Sbjct: 61 ENKFGDEKMLSDGDNPDASTSGESSGYYETNHSDTLFGLMPEFPEMELGTW--------- 120
Query: 121 EESSKSNTMANDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCLL--FH-- 180
T N +G +SS R RN+ +F R LIKPLS ++ CL+ FH
Sbjct: 121 ------KTSGNLVGDTQLNSSFR----RNQ-----RFRR--LIKPLSSMDSCLMSRFHRE 180
Query: 181 ERSLSPCIAVEMEENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRS 240
+ ++ + S + T+ R +S+ SL +S + E K+
Sbjct: 181 QMTIEDYMTSPFPSPHASVSRPLLVTDGTRVISKSAADSLWLSQHIVLS-----EDKATL 240
Query: 241 SSKMANMEHFASKGGPDKS-----------LVLYLGFFIGVIYSFISNNREVHELKELLK 300
S + +E + G +KS ++L +G IG++ SF+++ EV ++K+ LK
Sbjct: 241 SCGVPGVESSIERVGNEKSKSRKHGLGDATMLLQIGISIGIMSSFMASQAEVSKVKQELK 300
Query: 301 QTEYLVQDLQEELEMKDSLKLKELSNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNA 360
QTE LV DL++ELEMKD+L +KE+ +
Sbjct: 301 QTENLVHDLEDELEMKDTLIVKEIDIE--------------------------------- 360
Query: 361 EELYEHKAEESSESMSRIEAELEAELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLD 420
KA ESSES+S IEAELEAELERL +N+N+ + + + ++++
Sbjct: 361 ------KAAESSESISNIEAELEAELERLEINMNSSN-----------IETRLSDIIEME 420
Query: 421 PEFEEDYAEGELRNEMISEQSSGWTKHNEEAS-NSTVQSGNYTVPPRELSLRLHDVIQSR 480
P+ E ++A+GELR + + + T+ N++ S NST +SGNY V PRELSLRLH VI SR
Sbjct: 421 PDCEVEFAQGELRADRVKGKRLDETESNQDPSGNSTPESGNYAVSPRELSLRLHKVINSR 432
Query: 481 LEARIKELENALQNNHKKLQQI--DTQYRSCSWFKL 488
LE RI ELE ALQ + +K++Q+ +++ + SW +L
Sbjct: 481 LEKRIGELETALQESQRKVEQLVMESESKKKSWSRL 432
BLAST of CaUC04G077740 vs. TAIR 10
Match:
AT5G08010.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G61040.1); Has 5732 Blast hits to 4319 proteins in 440 species: Archae - 66; Bacteria - 397; Metazoa - 2437; Fungi - 292; Plants - 238; Viruses - 35; Other Eukaryotes - 2267 (source: NCBI BLink). )
HSP 1 Score: 130.2 bits (326), Expect = 4.8e-30
Identity = 143/489 (29.24%), Postives = 221/489 (45.19%), Query Frame = 0
Query: 1 MDLWIVAAATGAGYVAKYWKNQSKDGDYSSSQLSFGESILVSPQYSNHLFNKFSQRKKPH 60
MDLW++AA GY+ K+ +N SK G SS L+ L SP+ L + + KKP
Sbjct: 1 MDLWLIAATAATGYITKHLRNVSK-GKSSSEDLT--NVKLESPRC---LASNVVRVKKPK 60
Query: 61 EDVFGHGRMEETSSVADLG-LLGSHQDSNELSTPNMTLKSWINENSKGYNEESSKSNTMA 120
E+ F ET + + G G SN NE GY++E
Sbjct: 61 EENFEDCLNGETLDLYECGNAYGVEVASN-------------NEEDLGYDDEI------- 120
Query: 121 NDIGTLVCSSSGRTGSSRNRSTAKAKFSRGVLIKPLSFVEDCL-LFHERSL--------- 180
R+GS NR+ + IKP S + + H +
Sbjct: 121 ------------RSGSFGNRAFLR---RNQCPIKPFSLEKSIMSRLHREKISMEEYMRSP 180
Query: 181 --SPCIAVEMEENRLSKGYHVDATESVRGVSQLPFGSLRISDVVSNKTAKEWERKSRSSS 240
SPC +V ++ G +V + + VSQ I + K++ + ++ +
Sbjct: 181 FPSPCGSVS-RPLLVTDGTNVISKNTGDSVSQ-QVSECGIPQLRKLKSSLLYAKRGVGDA 240
Query: 241 KMANMEHFASKGGPDKSLVLYLGFFIGVIYSFISNNREVHELKELLKQTEYLVQDLQEEL 300
K + G D LVL +G IG++ SF++N E+++++ KQTE L ++L++++
Sbjct: 241 KSVSRRSDNGTGSNDPVLVLCVGISIGIMSSFVANQTELNKVRAESKQTENLGKELEDDI 300
Query: 301 EMKDSLKLKELSNDNCESYAYSNAFSEKTGDESSPKHLMDYTLDFNAEELYEHKAEESSE 360
+ E+ + K E+SE
Sbjct: 301 H--------------------------------------------DGEKQCDEKTAENSE 360
Query: 361 SMSRIEAELEAELERLGLNINTGGTVRFPDNEEEVMIQKANHDVQLDPEFEEDYAEGELR 420
S+S+IEAELEAELERL +N+ + + K + +L+P+FE ++A+GELR
Sbjct: 361 SISKIEAELEAELERLEINMISSN-----------IETKLSDVFELEPDFEVEFAQGELR 391
Query: 421 NEMISEQSSGWTKHNEE-ASNSTVQSGNYTVPPRELSLRLHDVIQSRLEARIKELENALQ 476
++ + Q T N+E +SNST +SGNY V PRELSLRL VI S E RIKELENALQ
Sbjct: 421 DDQVERQRFDETVSNQERSSNSTPESGNYIVSPRELSLRLLGVINSCYEKRIKELENALQ 391
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038895324.1 | 9.8e-219 | 82.28 | uncharacterized protein LOC120083576 [Benincasa hispida] | [more] |
XP_008449909.1 | 1.6e-200 | 77.64 | PREDICTED: uncharacterized protein LOC103491621 [Cucumis melo] >TYJ98934.1 peric... | [more] |
XP_011650179.2 | 8.0e-197 | 77.05 | uncharacterized protein LOC105434747 [Cucumis sativus] >KGN64051.1 hypothetical ... | [more] |
XP_022923688.1 | 3.4e-179 | 70.92 | uncharacterized protein LOC111431325 isoform X1 [Cucurbita moschata] >XP_0229236... | [more] |
KAG6584411.1 | 2.2e-178 | 70.67 | hypothetical protein SDJN03_20343, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3BIP8 | 7.6e-201 | 77.64 | Pericentriolar material 1 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... | [more] |
A0A1S3BN48 | 7.6e-201 | 77.64 | uncharacterized protein LOC103491621 OS=Cucumis melo OX=3656 GN=LOC103491621 PE=... | [more] |
A0A0A0LVL9 | 3.9e-197 | 77.05 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G039170 PE=4 SV=1 | [more] |
A0A6J1E743 | 1.6e-179 | 70.92 | uncharacterized protein LOC111431325 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1ECL1 | 4.0e-178 | 70.67 | uncharacterized protein LOC111431325 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G61040.1 | 1.0e-40 | 30.43 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G08010.1 | 4.8e-30 | 29.24 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |