Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATCAAATCAATGGCCATCCAAAAGCCACACCCAAGAAAATCAATCAATTCAATTCTCTCTCAAAAGCTCTTCATCTTCCTCCTCTCCACCTCCGCCGTCCTCCTCACCCTCTTCCACATCCGATCCCTCCACGCCGCCGCACCCTCCGCCGCCGAGCAGCTCCGGCGCTCCGTCACGTTTCTGCCCCTCAAGGACTTGCGTTACTCCCACAGAGCTCTCGAGGGGCACACGTGGTTCATGAGCTCCATGTACGACACCCACGAGGAGGGGGAGGTTCAGTTCCAGCAGTTCCCTTCGCCGGCCTCCGACGGTCGCCTCCTCTGCCTCCGCGGCCGCGACGCCCACGACGGCTCCTGGAACTACTACGCCGTCGCCTGGCCGGAAGCTCTGCCGGAAAACGCCACGTTTAGAAAAGGCTTGACCTTTGTTTCTTACAACCATTACGATTATGGGAACATCTGGCATGGCTTGTCCGCTCTCATGCCCTTCGTTGCTTGGCACCAGATTCAAGGTAAACGCCACTTTTCCAACGTTGCTTTTCGATTATTCCATTCATTAGTTCTTTTTTTAGTTTAATTAATTACTCCATCCTCTGGTGTTTTTTCCGTATTTTATTTTCAAATTGTTTTACCTAGTAGATAGCTCGAATTATCTCGTCTCATATGTGTTGTACTAAAAAAAATTGTATCTAATTAAATATTTACAGATAATTTTATTGGTTTAACATTTAGCACGGTGCTCAATCTAGAGTATAACTTAGTAAATAAGTCATTATTATCCTCCTTAAGATTAATGATTCGATTTCCTACTTCCAACTGTTAAATTAAAAAACAAAAATTTATGTGATACGTAATGATTACGTCTAAAATTAGATGTTATGTCTTGAAAGTATAAAACTAAACTGGCACAAATTTGAGAGTTTTAAAAGACCAAAATTGTAGTCTAACCCGCAAAACTTAGATTTTGTTATTTAAAATTAAATTAAATTACATCTAAACTTTTAGAGTTTTGTTTAATAATAAGTTCATTAACTTTATCTAATTGTGTCCTAACATTTAAATATAGTGTCATAAGTGTTTATTACAGCTTCATAAACTTTCATATTGCATCTAACAAAGTAAGATATTTTTCAAAATCTAAAAAAAGAAAAAAAAGAGAGGGTACCCTATATAAAAGTAGAAAAGTAGAAACAGAAAATGAAGAAAAAAGAAACGTTTCACCTAAGAAAAACCAAAATTTCGTCACATGATTTTCAGTGGTTGATTTGTTTTTTTATTCATGGTTTAGGAAATTGCGAGGCTCCGGAGAGATGGATCTTGTACCATTGGGGGGAGGTGAGGGTGTGGATGGCGACATGGCTGATGACGTTAATGGAGGCCACCTTCGGCCGGCCACCGGAGATCGAGGCCTTCCATGGCGTCGGCGAGGGGCAGGCAGTGTGCTTCGAGAAGGCAGTGGTGATGAGGCACAACGAGGGAGGAATGTCCCGACAGCGGAGAATGGAAACCTACGACTTGATGAGATGCAAGGCCAGGTTGTTCTGCAACGTCACCTCGCCCGAGCCATCACCGGCGGTGGGGATGACACTGCTCATGAGAACAGGGGCGAGGTCGTTTAGGAACGAAACTGCGGTGGTCGAGATATTTGGAGAGGAATGTGATAAAGTCCCCGGTTGCCACCTTACTGTAACTCACTCGAATAATCTAACCTTTTGTGATCAGGTATTCACTCGAGCTATATTTGAATTAGAATTAGAATTTTTTTTTTCAAAAAAATAAGTACCCAAGTATAGCTCATTAGTTATTGACATCTACTTCAAGTTTGGAAAGGGAAAAAAAAACTTTAACAAGTCGTAGATGATGCAATATTTATTTTATATAGTTAGAATCTAAGACACAGACATGGATATATACAATACGACATGGATACGGTGACACGTTATTTTCTAAAAATCTAAGACACCAGACATAGCAAGGACAGGTTTGTTAAAATATACAATTTTTTAGGAAGAAAATTCAAAGTCAATAAGCTTATGCATTTTTATGTATTAAAAAAACGAGTTTGATGAATTTTACTCTCAAAGTTTATTATTTTGTCCAATACTATTATTTATGTCTATTTCTTATGTTGAGTAAGTGTGTTTTATATGTGATTTTGAATCCATATTCACTTTTTACAACTAGTACGTGTGACATGCGTCTATTAAACTATTAAGTATCCTATAAGTATCCATATGTAAGAGTGTCAAACACGACACGTACCCAAAATGAAGTGTCCGTGTGTTAGAAGTGCATAAAAAAAAAATTAATTAAAACTATGGTTTCATATGATCATACTTTTAAGTTTGCTTTCATTTATGCTGATGAAAATGCAAATGCAAATGAAAAAAACAGGTGAGTTTGATGGGGAAGACAGACATATTGGTATCCCCACATGGAGCACAGTTGACAAACCTGTTTCTAATGGACAGAAACAGCAGTGTTATGGAGTTTTTCCCCAAAGGATGGCTCAAGCTTGCAGGCATTGGCCAGTTCGTGTTCCGTTGGCTCGCAAGCTGGTCTGGAATGAGGCATCAAGGTGATTGGAGAGACCCTCATGGCCCCGCCTGTCCCTATCCCGAACACGACCGTCGTTGCATGTCCGTTTATAAAAGTGGCACCATCGGGTACGCTTCTATTTTAGAATCCCATTTTCATCTCTAAACTTTTACCCGTTACCTGAGCTTCGAATGAAAGATTATTATATGCATTTATATTGAAATGTGAGTATACGTCTAGTTTGTCTATAAATTTGAATATATTTACGATTATAATAAAAATAGAGAATAAAATAGAAAATTTTTCGTATATTTATTTTTGTTGTTGATGTTAAACAGATACAATAGAACACAGTTTTCTGAGTGGGCTAAGAATGTTCTGAATGAGGTGAAGATGAGAAAGATGGAAGAAGCAGCACAGGGCTCTGCAAATCATGTTCATGAATGTTTTTGTAAC
mRNA sequence
AAAATCAAATCAATGGCCATCCAAAAGCCACACCCAAGAAAATCAATCAATTCAATTCTCTCTCAAAAGCTCTTCATCTTCCTCCTCTCCACCTCCGCCGTCCTCCTCACCCTCTTCCACATCCGATCCCTCCACGCCGCCGCACCCTCCGCCGCCGAGCAGCTCCGGCGCTCCGTCACGTTTCTGCCCCTCAAGGACTTGCGTTACTCCCACAGAGCTCTCGAGGGGCACACGTGGTTCATGAGCTCCATGTACGACACCCACGAGGAGGGGGAGGTTCAGTTCCAGCAGTTCCCTTCGCCGGCCTCCGACGGTCGCCTCCTCTGCCTCCGCGGCCGCGACGCCCACGACGGCTCCTGGAACTACTACGCCGTCGCCTGGCCGGAAGCTCTGCCGGAAAACGCCACGTTTAGAAAAGGCTTGACCTTTGTTTCTTACAACCATTACGATTATGGGAACATCTGGCATGGCTTGTCCGCTCTCATGCCCTTCGTTGCTTGGCACCAGATTCAAGGAAATTGCGAGGCTCCGGAGAGATGGATCTTGTACCATTGGGGGGAGGTGAGGGTGTGGATGGCGACATGGCTGATGACGTTAATGGAGGCCACCTTCGGCCGGCCACCGGAGATCGAGGCCTTCCATGGCGTCGGCGAGGGGCAGGCAGTGTGCTTCGAGAAGGCAGTGGTGATGAGGCACAACGAGGGAGGAATGTCCCGACAGCGGAGAATGGAAACCTACGACTTGATGAGATGCAAGGCCAGGTTGTTCTGCAACGTCACCTCGCCCGAGCCATCACCGGCGGTGGGGATGACACTGCTCATGAGAACAGGGGCGAGGTCGTTTAGGAACGAAACTGCGGTGGTCGAGATATTTGGAGAGGAATGTGATAAAGTCCCCGGTTGCCACCTTACTGTAACTCACTCGAATAATCTAACCTTTTGTGATCAGGTGAGTTTGATGGGGAAGACAGACATATTGGTATCCCCACATGGAGCACAGTTGACAAACCTGTTTCTAATGGACAGAAACAGCAGTGTTATGGAGTTTTTCCCCAAAGGATGGCTCAAGCTTGCAGGCATTGGCCAGTTCGTGTTCCGTTGGCTCGCAAGCTGGTCTGGAATGAGGCATCAAGGTGATTGGAGAGACCCTCATGGCCCCGCCTGTCCCTATCCCGAACACGACCGTCGTTGCATGTCCGTTTATAAAAGTGGCACCATCGGATACAATAGAACACAGTTTTCTGAGTGGGCTAAGAATGTTCTGAATGAGGTGAAGATGAGAAAGATGGAAGAAGCAGCACAGGGCTCTGCAAATCATGTTCATGAATGTTTTTGTAAC
Coding sequence (CDS)
AAAATCAAATCAATGGCCATCCAAAAGCCACACCCAAGAAAATCAATCAATTCAATTCTCTCTCAAAAGCTCTTCATCTTCCTCCTCTCCACCTCCGCCGTCCTCCTCACCCTCTTCCACATCCGATCCCTCCACGCCGCCGCACCCTCCGCCGCCGAGCAGCTCCGGCGCTCCGTCACGTTTCTGCCCCTCAAGGACTTGCGTTACTCCCACAGAGCTCTCGAGGGGCACACGTGGTTCATGAGCTCCATGTACGACACCCACGAGGAGGGGGAGGTTCAGTTCCAGCAGTTCCCTTCGCCGGCCTCCGACGGTCGCCTCCTCTGCCTCCGCGGCCGCGACGCCCACGACGGCTCCTGGAACTACTACGCCGTCGCCTGGCCGGAAGCTCTGCCGGAAAACGCCACGTTTAGAAAAGGCTTGACCTTTGTTTCTTACAACCATTACGATTATGGGAACATCTGGCATGGCTTGTCCGCTCTCATGCCCTTCGTTGCTTGGCACCAGATTCAAGGAAATTGCGAGGCTCCGGAGAGATGGATCTTGTACCATTGGGGGGAGGTGAGGGTGTGGATGGCGACATGGCTGATGACGTTAATGGAGGCCACCTTCGGCCGGCCACCGGAGATCGAGGCCTTCCATGGCGTCGGCGAGGGGCAGGCAGTGTGCTTCGAGAAGGCAGTGGTGATGAGGCACAACGAGGGAGGAATGTCCCGACAGCGGAGAATGGAAACCTACGACTTGATGAGATGCAAGGCCAGGTTGTTCTGCAACGTCACCTCGCCCGAGCCATCACCGGCGGTGGGGATGACACTGCTCATGAGAACAGGGGCGAGGTCGTTTAGGAACGAAACTGCGGTGGTCGAGATATTTGGAGAGGAATGTGATAAAGTCCCCGGTTGCCACCTTACTGTAACTCACTCGAATAATCTAACCTTTTGTGATCAGGTGAGTTTGATGGGGAAGACAGACATATTGGTATCCCCACATGGAGCACAGTTGACAAACCTGTTTCTAATGGACAGAAACAGCAGTGTTATGGAGTTTTTCCCCAAAGGATGGCTCAAGCTTGCAGGCATTGGCCAGTTCGTGTTCCGTTGGCTCGCAAGCTGGTCTGGAATGAGGCATCAAGGTGATTGGAGAGACCCTCATGGCCCCGCCTGTCCCTATCCCGAACACGACCGTCGTTGCATGTCCGTTTATAAAAGTGGCACCATCGGATACAATAGAACACAGTTTTCTGAGTGGGCTAAGAATGTTCTGAATGAGGTGAAGATGAGAAAGATGGAAGAAGCAGCACAGGGCTCTGCAAATCATGTTCATGAATGTTTTTGTAAC
Protein sequence
KIKSMAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPSAAEQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCLRGRDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQGNCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRHNEGGMSRQRRMETYDLMRCKARLFCNVTSPEPSPAVGMTLLMRTGARSFRNETAVVEIFGEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFPKGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGYNRTQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFCN
Homology
BLAST of MS000230 vs. NCBI nr
Match:
XP_022141026.1 (uncharacterized protein LOC111011532 [Momordica charantia])
HSP 1 Score: 924.5 bits (2388), Expect = 3.5e-265
Identity = 437/442 (98.87%), Postives = 439/442 (99.32%), Query Frame = 0
Query: 5 MAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPSAAEQLRRSVTFLPL 64
MAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPSAAEQLRRSVTFLPL
Sbjct: 1 MAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPSAAEQLRRSVTFLPL 60
Query: 65 KDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCLRGRDAHDGSWNYYA 124
KDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCL GRDAHDGSWNYYA
Sbjct: 61 KDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCLGGRDAHDGSWNYYA 120
Query: 125 VAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQGNCEAPERWILYH 184
VAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQGNCEAPERWILYH
Sbjct: 121 VAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQGNCEAPERWILYH 180
Query: 185 WGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRHNEGGMSRQRRME 244
WGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRHNEGGMSRQRRME
Sbjct: 181 WGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRHNEGGMSRQRRME 240
Query: 245 TYDLMRCKARLFCNVTSPEPSPAVGMTLLMRTGARSFRNETAVVEIFGEECDKVPGCHLT 304
TYDLMRCKARLFCNVTSPEPSPAVGMTLLMRTGARSF+NETAVV+IFGEECDKVPGCHLT
Sbjct: 241 TYDLMRCKARLFCNVTSPEPSPAVGMTLLMRTGARSFKNETAVVKIFGEECDKVPGCHLT 300
Query: 305 VTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFPKGWLKLAGIGQFV 364
VTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFPKGWLKLAGIGQFV
Sbjct: 301 VTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFPKGWLKLAGIGQFV 360
Query: 365 FRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGYNRTQFSEWAKNVLNEV 424
FRWLASWSGM HQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGYNRTQFSEWAKNVLNEV
Sbjct: 361 FRWLASWSGMTHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGYNRTQFSEWAKNVLNEV 420
Query: 425 KMRKMEEAAQGSANHVHECFCN 447
KMRKMEEAA GSANHVHECFCN
Sbjct: 421 KMRKMEEAA-GSANHVHECFCN 441
BLAST of MS000230 vs. NCBI nr
Match:
XP_023541716.1 (uncharacterized protein LOC111801789 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 754.2 bits (1946), Expect = 6.3e-214
Identity = 357/458 (77.95%), Postives = 391/458 (85.37%), Query Frame = 0
Query: 2 IKSMAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLH----------AAAPSA 61
+K KPH R + + + S KLFI+LLS SAVL FHI+SLH +++ SA
Sbjct: 2 VKPSHTSKPHQR-TTSILFSPKLFIYLLSISAVLFIFFHIQSLHRHVLPPPQNPSSSSSA 61
Query: 62 AEQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLLC 121
A +LRRSVTFLPLKDLRYSH+ LEGHTWFMSSMYD HE+GEVQFQQFPSPA+DG RLLC
Sbjct: 62 AAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDARLLC 121
Query: 122 LRGRDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQ 181
L+G D HDGSWNYYAVAWPE LPENAT KGL+FVSYNHY+Y NIWHGLSALMPFVAWHQ
Sbjct: 122 LKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVAWHQ 181
Query: 182 IQGNCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVV 241
IQG CE PERWILYHWGE+R+ M TW+ T+ME TFG PP+IEAF G+GEGQ VCFEKAVV
Sbjct: 182 IQGKCEIPERWILYHWGELRLKMGTWVRTIMEVTFGGPPKIEAFEGIGEGQPVCFEKAVV 241
Query: 242 MRHNEGGMSRQRRMETYDLMRCKARLFCNVTSPEPS-PAVGMTLLMRTGARSFRNETAVV 301
MRHNEGGMSRQRRMETYDLMRCKARLFCN TSPEPS VGMTL MRTGARSF+NETAV+
Sbjct: 242 MRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPEPSVTTVGMTLFMRTGARSFKNETAVM 301
Query: 302 EIFGEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVME 361
EIFG EC KV GC L V HSNNLTFC+QVSLMGKTDILVSPHGAQLTN+FLMDRNSSVME
Sbjct: 302 EIFGAECAKVAGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSSVME 361
Query: 362 FFPKGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGY 421
FFPKGWLKLAGIGQFV++W+ASWSGMRHQG WRDPHG CPY E DRRCMS++K GTIGY
Sbjct: 362 FFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPHGLTCPYNEDDRRCMSIFKGGTIGY 421
Query: 422 NRTQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFCN 447
NRT FSEWAKNVL+EVKMRKM+EAAQ + NHVHEC CN
Sbjct: 422 NRTYFSEWAKNVLDEVKMRKMDEAAQATTNHVHECSCN 458
BLAST of MS000230 vs. NCBI nr
Match:
XP_022984427.1 (uncharacterized protein LOC111482727 [Cucurbita maxima])
HSP 1 Score: 752.3 bits (1941), Expect = 2.4e-213
Identity = 358/459 (78.00%), Postives = 388/459 (84.53%), Query Frame = 0
Query: 2 IKSMAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAP-----------S 61
+K KPH R + + + S KLFI+LLS SAVL FHI+SLH P S
Sbjct: 2 VKPSHTSKPHQR-TTSILFSPKLFIYLLSISAVLFIFFHIQSLHRHVPPPPQNNPSSSSS 61
Query: 62 AAEQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLL 121
A +LRRSVTFLPLKDLRYSH+ LEGHTWFMSSMYD HE+GEVQFQQFPSPA+DG RLL
Sbjct: 62 AVAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDARLL 121
Query: 122 CLRGRDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWH 181
CL+G D HDGSWNYYAVAWPE LPENAT KGL+FVSYNHY+Y NIWHGLSALMPFVAWH
Sbjct: 122 CLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVAWH 181
Query: 182 QIQGNCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAV 241
QIQG CE PERWILYHWGE+R+ M TW+ T+ME TFG PP+IEAF G+GEGQ VCFEKAV
Sbjct: 182 QIQGKCEIPERWILYHWGELRLKMGTWVRTIMEVTFGGPPKIEAFDGIGEGQPVCFEKAV 241
Query: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNVTSPEPSPA-VGMTLLMRTGARSFRNETAV 301
VMRHNEGGMSRQRRMETYDLMRCKARLFCN TS EPS A VGMTL MRTGARSF+NETAV
Sbjct: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSSEPSVATVGMTLFMRTGARSFKNETAV 301
Query: 302 VEIFGEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVM 361
VEIFG EC+KV GC L V HSNNLTFC+QVSLMGKTDILVSPHGAQLTN+FLMDRNSSVM
Sbjct: 302 VEIFGAECNKVTGCQLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSSVM 361
Query: 362 EFFPKGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIG 421
EFFPKGWLKLAGIGQFV++W+ASWSGMRHQG WRDPHG CPY E DRRCMS++K GTIG
Sbjct: 362 EFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPHGLTCPYNEDDRRCMSIFKGGTIG 421
Query: 422 YNRTQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFCN 447
YNRT FSEWAKNVLNEVK+RKM EAA +ANHVHEC CN
Sbjct: 422 YNRTYFSEWAKNVLNEVKIRKMNEAAHATANHVHECSCN 459
BLAST of MS000230 vs. NCBI nr
Match:
XP_022942991.1 (uncharacterized protein LOC111447859 [Cucurbita moschata])
HSP 1 Score: 748.4 bits (1931), Expect = 3.5e-212
Identity = 355/459 (77.34%), Postives = 389/459 (84.75%), Query Frame = 0
Query: 2 IKSMAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAP-----------S 61
+K KPH R + + + S KLFI+LLS SA+L FHI+SLH P S
Sbjct: 2 VKPSHTSKPHQR-TTSILFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSSSSS 61
Query: 62 AAEQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLL 121
+A +LRRSVTFLPLKDLRYSH+ LEGHTWFMSSMYD HE+GEVQFQQFPSPA+DG RLL
Sbjct: 62 SAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDARLL 121
Query: 122 CLRGRDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWH 181
CL+G D HDGSWNYYAVAWPE LPENAT KGL+FVSYNHY+Y NIWHGLSALMPFVAWH
Sbjct: 122 CLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVAWH 181
Query: 182 QIQGNCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAV 241
QIQG CE PERWILYHWGE+R+ M TW+ T+ME TFG PP+IEAF G+ EGQ VCFEKAV
Sbjct: 182 QIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFEKAV 241
Query: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNVTSPEPSPA-VGMTLLMRTGARSFRNETAV 301
VMRHNEGGMSRQRRMETYDLMRCKARLFCN TSP+PS A VGMTL MRTGARSF+NETAV
Sbjct: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNETAV 301
Query: 302 VEIFGEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVM 361
VEIFG EC KV GC L V HSNNLTFC+QVSLMGKTDILVSPHGAQLTN+FLMDRNSSVM
Sbjct: 302 VEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSSVM 361
Query: 362 EFFPKGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIG 421
EFFPKGWLKLAGIGQFV++W+ASWSGMRHQG WRDP+G CPY E DRRCMS++K GTIG
Sbjct: 362 EFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPYNEDDRRCMSIFKGGTIG 421
Query: 422 YNRTQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFCN 447
YNRT FSEWAKNVLNEVK RKM+EAAQ +ANHVH+C CN
Sbjct: 422 YNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSCN 459
BLAST of MS000230 vs. NCBI nr
Match:
XP_011653390.1 (uncharacterized protein LOC101219216 [Cucumis sativus] >KGN53729.1 hypothetical protein Csa_014798 [Cucumis sativus])
HSP 1 Score: 706.8 bits (1823), Expect = 1.2e-199
Identity = 335/455 (73.63%), Postives = 379/455 (83.30%), Query Frame = 0
Query: 2 IKSMAIQKPHPR--KSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPS-----AAEQ 61
+K++ K R K+ N+++S KLF++LLS SA+L LFHI SLH P A +
Sbjct: 2 VKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAAK 61
Query: 62 LRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLLCLRG 121
LRRSVTFLPLKDLRYS++AL GHTWFMSS+YD EEGEVQ+QQFPSP DG R+LCL+G
Sbjct: 62 LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG 121
Query: 122 RDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQG 181
RD HDGSWNYY +AWPE LPENA +KG++FVSYNHYDY NIWHGLSALMPFVAWHQIQG
Sbjct: 122 RDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG 181
Query: 182 NCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRH 241
CE PERWILYHWGE+R+ M W+ TLMEATFG P + EAF + EGQ VCFEKAVVMRH
Sbjct: 182 KCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMRH 241
Query: 242 NEGGMSRQRRMETYDLMRCKARLFCNVTSPEP-SPAVGMTLLMRTGARSFRNETAVVEIF 301
NEGGMSRQRRMETYD MRCKARLFCN+TSPEP S AVGMT+LMRTG RSFRNET VVEIF
Sbjct: 242 NEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEIF 301
Query: 302 GEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFP 361
G+EC KV GC LTV +SNNLTFC+QVSLMGKTDIL+SPHGAQLTN+ LM+RNSSVMEFFP
Sbjct: 302 GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP 361
Query: 362 KGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGP-ACPYPEHDRRCMSVYKSGTIGYNR 421
KGWL+LAGIGQ+V+ WLASWSGMRHQG WRDP+ CPY DRRCMS+YK+GTIGYNR
Sbjct: 362 KGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYNR 421
Query: 422 TQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFC 446
T FSEWAK+VLNEVKMRKMEEA + + N +HEC C
Sbjct: 422 THFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 456
BLAST of MS000230 vs. ExPASy TrEMBL
Match:
A0A6J1CHS3 (uncharacterized protein LOC111011532 OS=Momordica charantia OX=3673 GN=LOC111011532 PE=4 SV=1)
HSP 1 Score: 924.5 bits (2388), Expect = 1.7e-265
Identity = 437/442 (98.87%), Postives = 439/442 (99.32%), Query Frame = 0
Query: 5 MAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPSAAEQLRRSVTFLPL 64
MAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPSAAEQLRRSVTFLPL
Sbjct: 1 MAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPSAAEQLRRSVTFLPL 60
Query: 65 KDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCLRGRDAHDGSWNYYA 124
KDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCL GRDAHDGSWNYYA
Sbjct: 61 KDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCLGGRDAHDGSWNYYA 120
Query: 125 VAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQGNCEAPERWILYH 184
VAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQGNCEAPERWILYH
Sbjct: 121 VAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQGNCEAPERWILYH 180
Query: 185 WGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRHNEGGMSRQRRME 244
WGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRHNEGGMSRQRRME
Sbjct: 181 WGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRHNEGGMSRQRRME 240
Query: 245 TYDLMRCKARLFCNVTSPEPSPAVGMTLLMRTGARSFRNETAVVEIFGEECDKVPGCHLT 304
TYDLMRCKARLFCNVTSPEPSPAVGMTLLMRTGARSF+NETAVV+IFGEECDKVPGCHLT
Sbjct: 241 TYDLMRCKARLFCNVTSPEPSPAVGMTLLMRTGARSFKNETAVVKIFGEECDKVPGCHLT 300
Query: 305 VTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFPKGWLKLAGIGQFV 364
VTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFPKGWLKLAGIGQFV
Sbjct: 301 VTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFPKGWLKLAGIGQFV 360
Query: 365 FRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGYNRTQFSEWAKNVLNEV 424
FRWLASWSGM HQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGYNRTQFSEWAKNVLNEV
Sbjct: 361 FRWLASWSGMTHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGYNRTQFSEWAKNVLNEV 420
Query: 425 KMRKMEEAAQGSANHVHECFCN 447
KMRKMEEAA GSANHVHECFCN
Sbjct: 421 KMRKMEEAA-GSANHVHECFCN 441
BLAST of MS000230 vs. ExPASy TrEMBL
Match:
A0A6J1J255 (uncharacterized protein LOC111482727 OS=Cucurbita maxima OX=3661 GN=LOC111482727 PE=4 SV=1)
HSP 1 Score: 752.3 bits (1941), Expect = 1.2e-213
Identity = 358/459 (78.00%), Postives = 388/459 (84.53%), Query Frame = 0
Query: 2 IKSMAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAP-----------S 61
+K KPH R + + + S KLFI+LLS SAVL FHI+SLH P S
Sbjct: 2 VKPSHTSKPHQR-TTSILFSPKLFIYLLSISAVLFIFFHIQSLHRHVPPPPQNNPSSSSS 61
Query: 62 AAEQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLL 121
A +LRRSVTFLPLKDLRYSH+ LEGHTWFMSSMYD HE+GEVQFQQFPSPA+DG RLL
Sbjct: 62 AVAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDARLL 121
Query: 122 CLRGRDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWH 181
CL+G D HDGSWNYYAVAWPE LPENAT KGL+FVSYNHY+Y NIWHGLSALMPFVAWH
Sbjct: 122 CLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVAWH 181
Query: 182 QIQGNCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAV 241
QIQG CE PERWILYHWGE+R+ M TW+ T+ME TFG PP+IEAF G+GEGQ VCFEKAV
Sbjct: 182 QIQGKCEIPERWILYHWGELRLKMGTWVRTIMEVTFGGPPKIEAFDGIGEGQPVCFEKAV 241
Query: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNVTSPEPSPA-VGMTLLMRTGARSFRNETAV 301
VMRHNEGGMSRQRRMETYDLMRCKARLFCN TS EPS A VGMTL MRTGARSF+NETAV
Sbjct: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSSEPSVATVGMTLFMRTGARSFKNETAV 301
Query: 302 VEIFGEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVM 361
VEIFG EC+KV GC L V HSNNLTFC+QVSLMGKTDILVSPHGAQLTN+FLMDRNSSVM
Sbjct: 302 VEIFGAECNKVTGCQLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSSVM 361
Query: 362 EFFPKGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIG 421
EFFPKGWLKLAGIGQFV++W+ASWSGMRHQG WRDPHG CPY E DRRCMS++K GTIG
Sbjct: 362 EFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPHGLTCPYNEDDRRCMSIFKGGTIG 421
Query: 422 YNRTQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFCN 447
YNRT FSEWAKNVLNEVK+RKM EAA +ANHVHEC CN
Sbjct: 422 YNRTYFSEWAKNVLNEVKIRKMNEAAHATANHVHECSCN 459
BLAST of MS000230 vs. ExPASy TrEMBL
Match:
A0A6J1FQH3 (uncharacterized protein LOC111447859 OS=Cucurbita moschata OX=3662 GN=LOC111447859 PE=4 SV=1)
HSP 1 Score: 748.4 bits (1931), Expect = 1.7e-212
Identity = 355/459 (77.34%), Postives = 389/459 (84.75%), Query Frame = 0
Query: 2 IKSMAIQKPHPRKSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAP-----------S 61
+K KPH R + + + S KLFI+LLS SA+L FHI+SLH P S
Sbjct: 2 VKPSHTSKPHQR-TTSILFSPKLFIYLLSISAILFIFFHIQSLHRHVPPRPQNNPSSSSS 61
Query: 62 AAEQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLL 121
+A +LRRSVTFLPLKDLRYSH+ LEGHTWFMSSMYD HE+GEVQFQQFPSPA+DG RLL
Sbjct: 62 SAAKLRRSVTFLPLKDLRYSHKPLEGHTWFMSSMYDIHEDGEVQFQQFPSPAADGDARLL 121
Query: 122 CLRGRDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWH 181
CL+G D HDGSWNYYAVAWPE LPENAT KGL+FVSYNHY+Y NIWHGLSALMPFVAWH
Sbjct: 122 CLKGNDTHDGSWNYYAVAWPETLPENATVMKGLSFVSYNHYNYDNIWHGLSALMPFVAWH 181
Query: 182 QIQGNCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAV 241
QIQG CE PERWILYHWGE+R+ M TW+ T+ME TFG PP+IEAF G+ EGQ VCFEKAV
Sbjct: 182 QIQGKCEIPERWILYHWGELRLKMGTWVSTIMEVTFGGPPKIEAFDGISEGQPVCFEKAV 241
Query: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNVTSPEPSPA-VGMTLLMRTGARSFRNETAV 301
VMRHNEGGMSRQRRMETYDLMRCKARLFCN TSP+PS A VGMTL MRTGARSF+NETAV
Sbjct: 242 VMRHNEGGMSRQRRMETYDLMRCKARLFCNFTSPKPSVATVGMTLFMRTGARSFKNETAV 301
Query: 302 VEIFGEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVM 361
VEIFG EC KV GC L V HSNNLTFC+QVSLMGKTDILVSPHGAQLTN+FLMDRNSSVM
Sbjct: 302 VEIFGAECTKVVGCRLRVAHSNNLTFCEQVSLMGKTDILVSPHGAQLTNMFLMDRNSSVM 361
Query: 362 EFFPKGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIG 421
EFFPKGWLKLAGIGQFV++W+ASWSGMRHQG WRDP+G CPY E DRRCMS++K GTIG
Sbjct: 362 EFFPKGWLKLAGIGQFVYQWMASWSGMRHQGAWRDPNGLTCPYNEDDRRCMSIFKGGTIG 421
Query: 422 YNRTQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFCN 447
YNRT FSEWAKNVLNEVK RKM+EAAQ +ANHVH+C CN
Sbjct: 422 YNRTYFSEWAKNVLNEVKTRKMDEAAQATANHVHQCSCN 459
BLAST of MS000230 vs. ExPASy TrEMBL
Match:
A0A0A0KXZ9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G112630 PE=4 SV=1)
HSP 1 Score: 706.8 bits (1823), Expect = 5.6e-200
Identity = 335/455 (73.63%), Postives = 379/455 (83.30%), Query Frame = 0
Query: 2 IKSMAIQKPHPR--KSINSILSQKLFIFLLSTSAVLLTLFHIRSLHAAAPS-----AAEQ 61
+K++ K R K+ N+++S KLF++LLS SA+L LFHI SLH P A +
Sbjct: 2 VKALQQSKSQSRTTKTTNNLVSPKLFLYLLSISALLFILFHIHSLHHHVPPPPSSIVAAK 61
Query: 62 LRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLLCLRG 121
LRRSVTFLPLKDLRYS++AL GHTWFMSS+YD EEGEVQ+QQFPSP DG R+LCL+G
Sbjct: 62 LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG 121
Query: 122 RDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQG 181
RD HDGSWNYY +AWPE LPENA +KG++FVSYNHYDY NIWHGLSALMPFVAWHQIQG
Sbjct: 122 RDTHDGSWNYYGLAWPEGLPENARVKKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG 181
Query: 182 NCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRH 241
CE PERWILYHWGE+R+ M W+ TLMEATFG P + EAF + EGQ VCFEKAVVMRH
Sbjct: 182 KCEVPERWILYHWGELRLRMGKWVSTLMEATFGAPLQFEAFEDISEGQPVCFEKAVVMRH 241
Query: 242 NEGGMSRQRRMETYDLMRCKARLFCNVTSPEP-SPAVGMTLLMRTGARSFRNETAVVEIF 301
NEGGMSRQRRMETYD MRCKARLFCN+TSPEP S AVGMT+LMRTG RSFRNET VVEIF
Sbjct: 242 NEGGMSRQRRMETYDFMRCKARLFCNLTSPEPLSAAVGMTMLMRTGPRSFRNETTVVEIF 301
Query: 302 GEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFP 361
G+EC KV GC LTV +SNNLTFC+QVSLMGKTDIL+SPHGAQLTN+ LM+RNSSVMEFFP
Sbjct: 302 GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP 361
Query: 362 KGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGP-ACPYPEHDRRCMSVYKSGTIGYNR 421
KGWL+LAGIGQ+V+ WLASWSGMRHQG WRDP+ CPY DRRCMS+YK+GTIGYNR
Sbjct: 362 KGWLELAGIGQYVYHWLASWSGMRHQGAWRDPNSTLPCPYSPGDRRCMSIYKAGTIGYNR 421
Query: 422 TQFSEWAKNVLNEVKMRKMEEAAQGSANHVHECFC 446
T FSEWAK+VLNEVKMRKMEEA + + N +HEC C
Sbjct: 422 THFSEWAKSVLNEVKMRKMEEATKVTTNQIHECSC 456
BLAST of MS000230 vs. ExPASy TrEMBL
Match:
A0A5D3CB36 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold544G00050 PE=4 SV=1)
HSP 1 Score: 685.3 bits (1767), Expect = 1.8e-193
Identity = 328/448 (73.21%), Postives = 369/448 (82.37%), Query Frame = 0
Query: 2 IKSMAIQKPHPR--KSINSILSQKLFIFLLSTSAVLLTLFHIRSLH-----AAAPSAAEQ 61
+K + K R K+ N+++ KLF++LLS SA+L LFHI SLH S +
Sbjct: 3 VKGLQQSKSQSRATKTTNNLVCPKLFLYLLSISALLSILFHIHSLHHHVLPPPPSSIVAK 62
Query: 62 LRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDG--RLLCLRG 121
LRRSVTFLPLKDLRYS++AL GHTWFMSS+YD EEGEVQ+QQFPSP DG R+LCL+G
Sbjct: 63 LRRSVTFLPLKDLRYSNKALVGHTWFMSSLYDIQEEGEVQYQQFPSPVVDGDERMLCLKG 122
Query: 122 RDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQG 181
RD HDGSWNYY +AWPE LPENAT KG++FVSYNHYDY NIWHGLSALMPFVAWHQIQG
Sbjct: 123 RDTHDGSWNYYGLAWPEGLPENATVMKGVSFVSYNHYDYQNIWHGLSALMPFVAWHQIQG 182
Query: 182 NCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRH 241
CE PERWILYHWGE+R+ M W+ TLMEATFG P IEAF G+ EGQ VCFEKAVVMRH
Sbjct: 183 KCEVPERWILYHWGELRLRMGKWVNTLMEATFGAPIRIEAFEGISEGQPVCFEKAVVMRH 242
Query: 242 NEGGMSRQRRMETYDLMRCKARLFCNVTSPEP-SPAVGMTLLMRTGARSFRNETAVVEIF 301
NEGGMSRQRRMETYD MRCKARL CN+TSPEP S AVGMT+LMRTG RSFRNET V EIF
Sbjct: 243 NEGGMSRQRRMETYDFMRCKARLLCNLTSPEPLSGAVGMTMLMRTGPRSFRNETTVAEIF 302
Query: 302 GEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFP 361
G+EC KV GC LTV +SNNLTFC+QVSLMGKTDIL+SPHGAQLTN+ LM+RNSSVMEFFP
Sbjct: 303 GKECAKVAGCRLTVAYSNNLTFCEQVSLMGKTDILISPHGAQLTNMILMNRNSSVMEFFP 362
Query: 362 KGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGP-ACPYPEHDRRCMSVYKSGTIGYNR 421
KGWL+LAGIGQ+V+ WLASWSGM+HQG WRDP+ CPY +DRRCMS YK GTIGYNR
Sbjct: 363 KGWLELAGIGQYVYHWLASWSGMKHQGAWRDPNSTLPCPYSPNDRRCMSFYKGGTIGYNR 422
Query: 422 TQFSEWAKNVLNEVKMRKMEEAAQGSAN 439
T FSEWAK+VLNEVKMRK+EEA + + N
Sbjct: 423 TYFSEWAKSVLNEVKMRKIEEATKFTTN 450
BLAST of MS000230 vs. TAIR 10
Match:
AT4G33590.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33600.1); Has 126 Blast hits to 126 proteins in 35 species: Archae - 0; Bacteria - 12; Metazoa - 0; Fungi - 21; Plants - 62; Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink). )
HSP 1 Score: 558.9 bits (1439), Expect = 3.6e-159
Identity = 249/391 (63.68%), Postives = 315/391 (80.56%), Query Frame = 0
Query: 50 SAAEQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLC 109
S E+LR SVTFLPLKD R+S++ LEGHTWFMSS+YD +GE Q+Q+FPS +S GRLLC
Sbjct: 71 SLVEKLRESVTFLPLKDYRFSNKPLEGHTWFMSSLYDNQTKGEAQYQEFPSDSSKGRLLC 130
Query: 110 LRGRDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQ 169
L+G D HDGSWN YA+AWPEALP NA + GLTFVSYN YDYGN+WHGL+A++PF+AW
Sbjct: 131 LKGVDEHDGSWNSYALAWPEALPTNAILQDGLTFVSYNQYDYGNLWHGLTAVVPFIAW-S 190
Query: 170 IQGNCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVV 229
++ CE P++W+LYHWGE+R M WL ++ AT+G+ P+ F V + + VCFEKAVV
Sbjct: 191 LRNQCEKPQKWVLYHWGELRFGMGHWLSEIVTATYGQEPDFLRF--VDDDKPVCFEKAVV 250
Query: 230 MRHNEGGMSRQRRMETYDLMRCKARLFCNVTSPEPS-PAVGMTLLMRTGARSFRNETAVV 289
MRHNEGGMSR+RRME +DL+RCKAR +CN++S S P +GMTLL+RTGARSFRNE+ V+
Sbjct: 251 MRHNEGGMSRERRMEAFDLIRCKARNYCNISSSVASKPRIGMTLLLRTGARSFRNESMVI 310
Query: 290 EIFGEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVME 349
++F +EC +V GC ++V++SNNL+FC+QV LM KTD+LVSPHGAQLTNLFLMD+NSSVME
Sbjct: 311 DVFKKECKRVDGCEISVSYSNNLSFCEQVELMKKTDVLVSPHGAQLTNLFLMDKNSSVME 370
Query: 350 FFPKGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCMSVYKSGTIGY 409
FFPKGWLKLAG+GQ VF+W A+WSGMRH+G W DP G C +P+ DRRCMS+YK+ IGY
Sbjct: 371 FFPKGWLKLAGVGQLVFQWGANWSGMRHEGSWHDPVGEICQFPDTDRRCMSIYKNAMIGY 430
Query: 410 NRTQFSEWAKNVLNEVKMRKMEEAAQGSANH 440
N T F EWA+ VL + +R+M+E A+ NH
Sbjct: 431 NETYFGEWARRVLGKFSIREMKELAE--CNH 456
BLAST of MS000230 vs. TAIR 10
Match:
AT4G33600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4 anthesis, C globular stage, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33590.1); Has 131 Blast hits to 131 proteins in 40 species: Archae - 0; Bacteria - 9; Metazoa - 12; Fungi - 24; Plants - 58; Viruses - 0; Other Eukaryotes - 28 (source: NCBI BLink). )
HSP 1 Score: 552.0 bits (1421), Expect = 4.5e-157
Identity = 254/397 (63.98%), Postives = 311/397 (78.34%), Query Frame = 0
Query: 53 EQLRRSVTFLPLKDLRYSHRALEGHTWFMSSMYDTHEEGEVQFQQFPSPASDGRLLCLRG 112
E+LR SVTFLPLKDLR+S++ LEGHTWFMSS+YD +GEVQ+Q+FPS +S GRLLCL+G
Sbjct: 77 EKLRESVTFLPLKDLRFSNKPLEGHTWFMSSLYDNQTKGEVQYQEFPSESSKGRLLCLKG 136
Query: 113 RDAHDGSWNYYAVAWPEALPENATFRKGLTFVSYNHYDYGNIWHGLSALMPFVAWHQIQG 172
D HDGSWNYYA+AWP+ALP NA+ ++GLTFVSYNHYDYGN+WHGLSA++PFVAW ++
Sbjct: 137 VDEHDGSWNYYALAWPQALPVNASLQEGLTFVSYNHYDYGNMWHGLSAMVPFVAW-SLRH 196
Query: 173 NCEAPERWILYHWGEVRVWMATWLMTLMEATFGRPPEIEAFHGVGEGQAVCFEKAVVMRH 232
CE P+RW+LYHWGE+R M WL ++ AT+G+ E F + + VCFEKAVVMRH
Sbjct: 197 QCENPQRWVLYHWGELRFKMGNWLNEIITATYGQNTEFLRFR--DKNRPVCFEKAVVMRH 256
Query: 233 NEGGMSRQRRMETYDLMRCKARLFCNVTSPEPSPA-VGMTLLMRTGARSFRNETAVVEIF 292
NEGGMSR+RRME +DL+RCKAR +CN++ E S + +GMTLLMRTG RSF+NE+AV++IF
Sbjct: 257 NEGGMSRERRMEVFDLIRCKARHYCNISLSETSKSRIGMTLLMRTGPRSFKNESAVIDIF 316
Query: 293 GEECDKVPGCHLTVTHSNNLTFCDQVSLMGKTDILVSPHGAQLTNLFLMDRNSSVMEFFP 352
EC V GC L V++SNNLTFC+QV LM TD+LVSPHGAQLTNL LMDRNSSVMEF P
Sbjct: 317 KRECKNVEGCELKVSYSNNLTFCEQVELMRMTDVLVSPHGAQLTNLVLMDRNSSVMEFLP 376
Query: 353 KGWLKLAGIGQFVFRWLASWSGMRHQGDWRDPHGPACPYPEHDRRCM-SVYKSGTIGYNR 412
KGW KLAG+GQ V++W WSGMRH+G W DP G C +P+ DRRCM SVYK+G IGYN
Sbjct: 377 KGWRKLAGVGQLVYQWGTRWSGMRHEGSWHDPDGEICQFPDTDRRCMSSVYKNGRIGYNE 436
Query: 413 TQFSEWAKNVLNEVKMRKMEEAA--QGSANHVHECFC 446
T F EWAK+VL + K RKM + S + C+C
Sbjct: 437 TYFGEWAKSVLGKFKERKMANVVGRKHSYGSLDGCWC 470
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022141026.1 | 3.5e-265 | 98.87 | uncharacterized protein LOC111011532 [Momordica charantia] | [more] |
XP_023541716.1 | 6.3e-214 | 77.95 | uncharacterized protein LOC111801789 [Cucurbita pepo subsp. pepo] | [more] |
XP_022984427.1 | 2.4e-213 | 78.00 | uncharacterized protein LOC111482727 [Cucurbita maxima] | [more] |
XP_022942991.1 | 3.5e-212 | 77.34 | uncharacterized protein LOC111447859 [Cucurbita moschata] | [more] |
XP_011653390.1 | 1.2e-199 | 73.63 | uncharacterized protein LOC101219216 [Cucumis sativus] >KGN53729.1 hypothetical ... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CHS3 | 1.7e-265 | 98.87 | uncharacterized protein LOC111011532 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A6J1J255 | 1.2e-213 | 78.00 | uncharacterized protein LOC111482727 OS=Cucurbita maxima OX=3661 GN=LOC111482727... | [more] |
A0A6J1FQH3 | 1.7e-212 | 77.34 | uncharacterized protein LOC111447859 OS=Cucurbita moschata OX=3662 GN=LOC1114478... | [more] |
A0A0A0KXZ9 | 5.6e-200 | 73.63 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G112630 PE=4 SV=1 | [more] |
A0A5D3CB36 | 1.8e-193 | 73.21 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT4G33590.1 | 3.6e-159 | 63.68 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G33600.1 | 4.5e-157 | 63.98 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |