MS000966 (gene) Bitter gourd (TR) v1

Overview
NameMS000966
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionflocculation protein FLO11
Locationscaffold36: 519642 .. 524418 (+)
RNA-Seq ExpressionMS000966
SyntenyMS000966
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCCTTCTCCAGCATTGAGGAGCTCTCCTGGTAGAGAGCTCAGAGGCAGCAATCATAAGCGAGGCCACAGCTTTGAGAGTGGCATGTGCATAAGAGAGAAAGATGACGATCTTGCTTTATTCAATGAAATGCAGACTCGAGAAAGAGAGAGTTTCCTGCTTCAGTCGGCAGAGGACTTGGAGGACTCATTTTGTAATTACGAGCTAACTCAATTTTATCATTTATATATTTTTTTGCTGCCAAATATAAAAACACGACCATTTAATTAGGCTGTCAATGTTGCACTTCTCATGAACTTTTTGATAGTTTTCTGCAGCTACAAAGTTGAGGCACTTTTCCGATATCAAGCTTGGTATCTCCATTCCTGTTCGTGGAGAGAATAGTGAATTGCTTAATGTAGATGGGGAGAAAAATGACTATGACTGGTGAGTGGCCTTATTCCTGTTCCAAGTGACTTACTCTTAAATCACCAATTGCTTATAATTATTAGGAGTGTGTGTATGTGTATGTGTGTGTTTGGTCCGTATAACTTCTTGCTAATTCATACTTGTGGCCGCTACCTATTTACTTTCTGCCGAAGTGGCTTGTGATAATTTTTGGTGGACTACTGTGCCGGTACTTGTTTATCATATGGTTGTTTGAAGCTGAATAATCAATCACCTTGCTCTTGGCTCTCACAAAATGTTCTGTATTATCCATCTCAAGGTTGTTAACACCTCCCGACACCCCTCTTTTCCCTTCATTGGACAATGATCCGCCTCCAGTTACTCTTGCAAGCAGGGGGAGACCTCGTAGTCAACCCATTTCCATATCACGATCATCCACGGTATGTGATTAAACTGTTTATATAGAATATAATTGTGTTGCGATCATGTTGGATAAGTCTCTAAAAGTGTCTTTTTACAATTGAGCTATTTGATGTTCTGTACTTCTTTGTTGAAACACTGAAGAGTTTTGTTTTCATAGATGGAAAAAAGTCACAGAAGCAGTACGAGTAGGGGTAGTGCAAGTCCTAATCGTTTAAGCCCATCGCCTAGGTCAGCAAGCAGTGTGCCTCAAATGCGGGGAAGGCAACTGTCAGCCCCGCACTCTAGCCCAACTCCAAGTCTACGACATGCCACACCATCTAGAAGATCAACTACCCCTGCAAGAAGATCATCACCTCCTCCAAGTACACCATCAATATCTGTGACAAGGTCTTCCACCCCAACTCCCAGGAGGTTGAGCACAGGGTCGAGTGGAACTTCTACCACATCTGGGGCAAGGGGAACTTCACCTATAAAGGCAGTAAGAGGAAATTCTGCTTCACCTAAAATAAGAGCATGGCAAACTAATATTCCTGGTTTCTCTTCTGATGCTCCCCCTAACCTTCGAACATCCCTGGCTGATCGGCCAGCATCATATACGAGAGGATCTTCACCAGCTTCTCGAAACAGTATGGACCTTCAATACAAGTACAGTAGGCAATCGATGTCTCCAACTGCTCCTATCAGCTCATCCCATAGTCACGATCGAGATCGCTATAGCTCTTACAGTAGAGGTTCGAAGGCCTCATCTGGTGATGATGATTTAGACTCCCTGCAGTCAATTCCTACTAGCAGTTTGGATAATTCATTGTCAAAAGGAGGGAATACATTTTCAAATAACAAAGCTCTAACCATATCAAAGAAACACAGAATAGTGTCTTCTACTTCCACTCCCAAAAGATCCCTCGATTCCACTATTCGACAATTGGTATGGATTTTATTTTTTAGTTTTTGAATGATATGATATTTTGTACTTTGGAACTCACATTTTGGTTAACATAAACAAGCTGTAAAGTTTGTTTTTTTTTCTCAGATTAGTTGTTACATTCACGTGACATACTTTTATTGGGACTGTTTGAATGTTAGTATTTAAAATTTAACATGGTTATTTTATGAACAATTTGAGAAACTGGAAAAAAGATTTTCAACTCTTTTAGTGACTTAGTGCGTAAGTTTTCTGCAAAGTACAATTGATTGTTTGTTGATCTTGTTATTCTCCCGAGTAAGTTCATGTTAGTAAACAGTTTAATGATGTGGAATTTGGCAGAAAAAAATATTTCCATTTCAATGTACTTTGTGAGTCTGACTCATCTTAGATAATCACATTTCTTCTTTACTTGTGTCAGTCATGGTGTGCTTTTATAGTTTCACTGGAGTTTACATGACAACTAATGGTGCATATGCACTTCCTTATTTTTCTCCAGGATCGAAAGAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGTACCACCTTTTATACTGGCAAGGTGAGTTCTGCCCATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTTATAATGGTGCAAGTATTGTACTTGATACAGAAGTGAGTGACCTAAATCAGGATGATATGGCAAATGAATGTGAGAAAGTGCCATATCATGACATTCATGAAGAGATATTCGCTTTCGATAAGATGGATATTGTTAATGAAAATCCCATTAATGATATCAAGTCACTTGATAGTGGCCCTGCACCTGGTTGTGATCCTGTTCTCACTGAAGATAGCAGTCACCAAACTATTATCCCAGAAATAAGTTCCACTTTTGACTCTTCTCGTGCCCAGGGGAATGCTTTTTCAGAGGTTGTTTGCCTTGATGATATAATTTTGTGTCCCAGATGTGGTTGTAGGTATTGTGTCATTGACACAGAGGAAAATAACATTAATCTTTGTCCAGAATGCAGTAGGAAAGAAAAATACCTTGGCATGACCCTTTTGGAAAATATGACTTCAGTTACTGAAAGCATATCAGGGTATTCAATAAAGTATGAAGCAGGTAAGCCTTTCAATAAGGTGGAGTCAGGGGTGATTTCGCTTGAATCTTCCCTAGCTACTGATTTGGGTGAATCTAGGATTTCCGAGTCTCTGGGCAATGTTGAGCAAGATCAAGCATCTTATCCTGAAGAAGGCTTGAGTTACCAGAAAGAAAACTTCCCCTCAGAAACACCGGTGAGCGAAAGTCAGCATAGCCTTATCAACCATTCAGAGATAGGCCAATTAGTTGTTAGTGGCAGTCAGTCCAACACTGAATCTGGATATCAGCAACCTCTTCATCATAACGACTATAAAGATTTGAGGTTTGATTCATCAGAAGGTGCAGGTATTTCTATATTGTTAAAGAGATCGAGCAGCAGTAAGGGCCCTATTGTCCAAGGAAGAACTTTTACTACAAGTACCATATCTTATGATGATCTGTCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGTTCTGTTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTCAGCTCAGCCAGACAGATTGAAGCCCGAATACAACGTCAGGTAAGTTCAAGGAAAGGGGAATTAGAGAGCAAAAAGGGTGAAATTTGCGTGAAATCTCATATCTCTGAGGCAGCTTCTTCTGGAACACCTACCAATGCTCATCCTGTATTAGGCTTTGAAACTTGTGAGCAAGAGGAAAATTTGGATTTTACTGTGGCCAATTTAGAATGTTTTTCCAGTCAGGGAACTACTAATTCTTCTCAGAAACCTGAACTAGCTTCTGAAAATGCTGAATCAGATGACACCTCTTCAATTGTGGTTGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACTGTAGAATACTGGACACTTGTACCTCGGAATCGTCAAGGTATGCCCTCAGCAACAATTTTGATTTTTTTTTTTGGGACATTCAATCCAAAATGGTTTATTATACTTAACATGTGAACAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCACCAGTTACAACTTCCGACTGTTCCAAATTGGAGGGACACAACATGCCGGATGTTAGTGCGTTTGAAGATGAACAACATCCCAATCATTTAATGACCACAATATCAGAAAAAGAAACGAAACAAATAGCTGAGGTGATAGCACCTGGTTCACAGGGTGATTTATCAATAATATCAAAAAGTCTCCTGGAGGAGGAATCTATGGTTCCTAGTGGGCCTGACGAGGATTTAACACCACCTGTTATTAATACTGAAAAATCTTATGGTATCCTAGGTATGGAAGTTTCTTTGGTTGCGATTTATATGCTTTTCAGCTTTTGTTTAAAATCTCTCCTTGCATCTCTTTTAAGTTCCTAAATTTCTTTAATACTGTGATCTTGAGATTATTTGACTAATCCTGGGTTTTAATCATGTCCTCTTCTAAAATTACTTTAAACTTATTTTGATGAATTTTCACGTTGTCTCCGATCCCTGATAAAACTTTCTGTTTGGCAGAAGAATCAACAGTAATTGTCGATTACCAGGGCAGAAGAAAGGTGGTGAGAAGCTTGACACTTGAAGAGGCAACAGATACAATTCTTTTCTGCAGCTCTATTGTTCATGATCTAGCCTATTCAGCTGCTTCCATAGCAATTGAAAAGGAAAACGAGGTTACATTGGAAGGCTCACGGCCAACAGTTACCATTTTGGGAAAATCTAACACTGACAGAAGCGATCTACGCAGTAGAACAGGCGGCAAACGGGTTATGAAATCTCAAAAACTGAGACAACGGCATGTGGAAATGAGTACAAAGCCTCCTGTTACGAAGACAGAGAACGACGAGAACACCGACGAGTCCACCATTCGAAATGTAGGTCTTCCTAACCAAGTGGACAGCATGAAACCTCTAAAGCTGGAATCCAAATGCAACTGCAGTATAATG

mRNA sequence

ATGCCTCCTTCTCCAGCATTGAGGAGCTCTCCTGGTAGAGAGCTCAGAGGCAGCAATCATAAGCGAGGCCACAGCTTTGAGAGTGGCATGTGCATAAGAGAGAAAGATGACGATCTTGCTTTATTCAATGAAATGCAGACTCGAGAAAGAGAGAGTTTCCTGCTTCAGTCGGCAGAGGACTTGGAGGACTCATTTTCTACAAAGTTGAGGCACTTTTCCGATATCAAGCTTGGTATCTCCATTCCTGTTCGTGGAGAGAATAGTGAATTGCTTAATGTAGATGGGGAGAAAAATGACTATGACTGGTTGTTAACACCTCCCGACACCCCTCTTTTCCCTTCATTGGACAATGATCCGCCTCCAGTTACTCTTGCAAGCAGGGGGAGACCTCGTAGTCAACCCATTTCCATATCACGATCATCCACGATGGAAAAAAGTCACAGAAGCAGTACGAGTAGGGGTAGTGCAAGTCCTAATCGTTTAAGCCCATCGCCTAGGTCAGCAAGCAGTGTGCCTCAAATGCGGGGAAGGCAACTGTCAGCCCCGCACTCTAGCCCAACTCCAAGTCTACGACATGCCACACCATCTAGAAGATCAACTACCCCTGCAAGAAGATCATCACCTCCTCCAAGTACACCATCAATATCTGTGACAAGGTCTTCCACCCCAACTCCCAGGAGGTTGAGCACAGGGTCGAGTGGAACTTCTACCACATCTGGGGCAAGGGGAACTTCACCTATAAAGGCAGTAAGAGGAAATTCTGCTTCACCTAAAATAAGAGCATGGCAAACTAATATTCCTGGTTTCTCTTCTGATGCTCCCCCTAACCTTCGAACATCCCTGGCTGATCGGCCAGCATCATATACGAGAGGATCTTCACCAGCTTCTCGAAACAGTATGGACCTTCAATACAAGTACAGTAGGCAATCGATGTCTCCAACTGCTCCTATCAGCTCATCCCATAGTCACGATCGAGATCGCTATAGCTCTTACAGTAGAGGTTCGAAGGCCTCATCTGGTGATGATGATTTAGACTCCCTGCAGTCAATTCCTACTAGCAGTTTGGATAATTCATTGTCAAAAGGAGGGAATACATTTTCAAATAACAAAGCTCTAACCATATCAAAGAAACACAGAATAGTGTCTTCTACTTCCACTCCCAAAAGATCCCTCGATTCCACTATTCGACAATTGGTATGGATTTTATTTTTTATCATGGTGTGCTTTTATAGTTTCACTGGAGTTTACATGACAACTAATGGTGCATATGCACTTATTTTTCTCCAGGATCGAAAGAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGTACCACCTTTTATACTGGCAAGGTGAGTTCTGCCCATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTTATAATGGTGCAAGTATTGTACTTGATACAGAAGTGAGTGACCTAAATCAGGATGATATGGCAAATGAATGTGAGAAAGTGCCATATCATGACATTCATGAAGAGATATTCGCTTTCGATAAGATGGATATTGTTAATGAAAATCCCATTAATGATATCAAGTCACTTGATAGTGGCCCTGCACCTGGTTGTGATCCTGTTCTCACTGAAGATAGCAGTCACCAAACTATTATCCCAGAAATAAGTTCCACTTTTGACTCTTCTCGTGCCCAGGGGAATGCTTTTTCAGAGGTTGTTTGCCTTGATGATATAATTTTGTGTCCCAGATGTGGTTGTAGGTATTGTGTCATTGACACAGAGGAAAATAACATTAATCTTTGTCCAGAATGCAGTAGGAAAGAAAAATACCTTGGCATGACCCTTTTGGAAAATATGACTTCAGTTACTGAAAGCATATCAGGGTATTCAATAAAGTATGAAGCAGGTAAGCCTTTCAATAAGGTGGAGTCAGGGGTGATTTCGCTTGAATCTTCCCTAGCTACTGATTTGGGTGAATCTAGGATTTCCGAGTCTCTGGGCAATGTTGAGCAAGATCAAGCATCTTATCCTGAAGAAGGCTTGAGTTACCAGAAAGAAAACTTCCCCTCAGAAACACCGGTGAGCGAAAGTCAGCATAGCCTTATCAACCATTCAGAGATAGGCCAATTAGTTGTTAGTGGCAGTCAGTCCAACACTGAATCTGGATATCAGCAACCTCTTCATCATAACGACTATAAAGATTTGAGGTTTGATTCATCAGAAGGTGCAGGTATTTCTATATTGTTAAAGAGATCGAGCAGCAGTAAGGGCCCTATTGTCCAAGGAAGAACTTTTACTACAAGTACCATATCTTATGATGATCTGTCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGTTCTGTTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTCAGCTCAGCCAGACAGATTGAAGCCCGAATACAACGTCAGGTAAGTTCAAGGAAAGGGGAATTAGAGAGCAAAAAGGGTGAAATTTGCGTGAAATCTCATATCTCTGAGGCAGCTTCTTCTGGAACACCTACCAATGCTCATCCTGTATTAGGCTTTGAAACTTGTGAGCAAGAGGAAAATTTGGATTTTACTGTGGCCAATTTAGAATGTTTTTCCAGTCAGGGAACTACTAATTCTTCTCAGAAACCTGAACTAGCTTCTGAAAATGCTGAATCAGATGACACCTCTTCAATTGTGGTTGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACTGTAGAATACTGGACACTTGTACCTCGGAATCGTCAAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCACCAGTTACAACTTCCGACTGTTCCAAATTGGAGGGACACAACATGCCGGATGTTAGTGCGTTTGAAGATGAACAACATCCCAATCATTTAATGACCACAATATCAGAAAAAGAAACGAAACAAATAGCTGAGGTGATAGCACCTGGTTCACAGGGTGATTTATCAATAATATCAAAAAGTCTCCTGGAGGAGGAATCTATGGTTCCTAGTGGGCCTGACGAGGATTTAACACCACCTGTTATTAATACTGAAAAATCTTATGGTATCCTAGAAGAATCAACAGTAATTGTCGATTACCAGGGCAGAAGAAAGGTGGTGAGAAGCTTGACACTTGAAGAGGCAACAGATACAATTCTTTTCTGCAGCTCTATTGTTCATGATCTAGCCTATTCAGCTGCTTCCATAGCAATTGAAAAGGAAAACGAGGTTACATTGGAAGGCTCACGGCCAACAGTTACCATTTTGGGAAAATCTAACACTGACAGAAGCGATCTACGCAGTAGAACAGGCGGCAAACGGGTTATGAAATCTCAAAAACTGAGACAACGGCATGTGGAAATGAGTACAAAGCCTCCTGTTACGAAGACAGAGAACGACGAGAACACCGACGAGTCCACCATTCGAAATGTAGGTCTTCCTAACCAAGTGGACAGCATGAAACCTCTAAAGCTGGAATCCAAATGCAACTGCAGTATAATG

Coding sequence (CDS)

ATGCCTCCTTCTCCAGCATTGAGGAGCTCTCCTGGTAGAGAGCTCAGAGGCAGCAATCATAAGCGAGGCCACAGCTTTGAGAGTGGCATGTGCATAAGAGAGAAAGATGACGATCTTGCTTTATTCAATGAAATGCAGACTCGAGAAAGAGAGAGTTTCCTGCTTCAGTCGGCAGAGGACTTGGAGGACTCATTTTCTACAAAGTTGAGGCACTTTTCCGATATCAAGCTTGGTATCTCCATTCCTGTTCGTGGAGAGAATAGTGAATTGCTTAATGTAGATGGGGAGAAAAATGACTATGACTGGTTGTTAACACCTCCCGACACCCCTCTTTTCCCTTCATTGGACAATGATCCGCCTCCAGTTACTCTTGCAAGCAGGGGGAGACCTCGTAGTCAACCCATTTCCATATCACGATCATCCACGATGGAAAAAAGTCACAGAAGCAGTACGAGTAGGGGTAGTGCAAGTCCTAATCGTTTAAGCCCATCGCCTAGGTCAGCAAGCAGTGTGCCTCAAATGCGGGGAAGGCAACTGTCAGCCCCGCACTCTAGCCCAACTCCAAGTCTACGACATGCCACACCATCTAGAAGATCAACTACCCCTGCAAGAAGATCATCACCTCCTCCAAGTACACCATCAATATCTGTGACAAGGTCTTCCACCCCAACTCCCAGGAGGTTGAGCACAGGGTCGAGTGGAACTTCTACCACATCTGGGGCAAGGGGAACTTCACCTATAAAGGCAGTAAGAGGAAATTCTGCTTCACCTAAAATAAGAGCATGGCAAACTAATATTCCTGGTTTCTCTTCTGATGCTCCCCCTAACCTTCGAACATCCCTGGCTGATCGGCCAGCATCATATACGAGAGGATCTTCACCAGCTTCTCGAAACAGTATGGACCTTCAATACAAGTACAGTAGGCAATCGATGTCTCCAACTGCTCCTATCAGCTCATCCCATAGTCACGATCGAGATCGCTATAGCTCTTACAGTAGAGGTTCGAAGGCCTCATCTGGTGATGATGATTTAGACTCCCTGCAGTCAATTCCTACTAGCAGTTTGGATAATTCATTGTCAAAAGGAGGGAATACATTTTCAAATAACAAAGCTCTAACCATATCAAAGAAACACAGAATAGTGTCTTCTACTTCCACTCCCAAAAGATCCCTCGATTCCACTATTCGACAATTGGTATGGATTTTATTTTTTATCATGGTGTGCTTTTATAGTTTCACTGGAGTTTACATGACAACTAATGGTGCATATGCACTTATTTTTCTCCAGGATCGAAAGAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGTACCACCTTTTATACTGGCAAGGTGAGTTCTGCCCATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTTATAATGGTGCAAGTATTGTACTTGATACAGAAGTGAGTGACCTAAATCAGGATGATATGGCAAATGAATGTGAGAAAGTGCCATATCATGACATTCATGAAGAGATATTCGCTTTCGATAAGATGGATATTGTTAATGAAAATCCCATTAATGATATCAAGTCACTTGATAGTGGCCCTGCACCTGGTTGTGATCCTGTTCTCACTGAAGATAGCAGTCACCAAACTATTATCCCAGAAATAAGTTCCACTTTTGACTCTTCTCGTGCCCAGGGGAATGCTTTTTCAGAGGTTGTTTGCCTTGATGATATAATTTTGTGTCCCAGATGTGGTTGTAGGTATTGTGTCATTGACACAGAGGAAAATAACATTAATCTTTGTCCAGAATGCAGTAGGAAAGAAAAATACCTTGGCATGACCCTTTTGGAAAATATGACTTCAGTTACTGAAAGCATATCAGGGTATTCAATAAAGTATGAAGCAGGTAAGCCTTTCAATAAGGTGGAGTCAGGGGTGATTTCGCTTGAATCTTCCCTAGCTACTGATTTGGGTGAATCTAGGATTTCCGAGTCTCTGGGCAATGTTGAGCAAGATCAAGCATCTTATCCTGAAGAAGGCTTGAGTTACCAGAAAGAAAACTTCCCCTCAGAAACACCGGTGAGCGAAAGTCAGCATAGCCTTATCAACCATTCAGAGATAGGCCAATTAGTTGTTAGTGGCAGTCAGTCCAACACTGAATCTGGATATCAGCAACCTCTTCATCATAACGACTATAAAGATTTGAGGTTTGATTCATCAGAAGGTGCAGGTATTTCTATATTGTTAAAGAGATCGAGCAGCAGTAAGGGCCCTATTGTCCAAGGAAGAACTTTTACTACAAGTACCATATCTTATGATGATCTGTCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGTTCTGTTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTCAGCTCAGCCAGACAGATTGAAGCCCGAATACAACGTCAGGTAAGTTCAAGGAAAGGGGAATTAGAGAGCAAAAAGGGTGAAATTTGCGTGAAATCTCATATCTCTGAGGCAGCTTCTTCTGGAACACCTACCAATGCTCATCCTGTATTAGGCTTTGAAACTTGTGAGCAAGAGGAAAATTTGGATTTTACTGTGGCCAATTTAGAATGTTTTTCCAGTCAGGGAACTACTAATTCTTCTCAGAAACCTGAACTAGCTTCTGAAAATGCTGAATCAGATGACACCTCTTCAATTGTGGTTGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACTGTAGAATACTGGACACTTGTACCTCGGAATCGTCAAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCACCAGTTACAACTTCCGACTGTTCCAAATTGGAGGGACACAACATGCCGGATGTTAGTGCGTTTGAAGATGAACAACATCCCAATCATTTAATGACCACAATATCAGAAAAAGAAACGAAACAAATAGCTGAGGTGATAGCACCTGGTTCACAGGGTGATTTATCAATAATATCAAAAAGTCTCCTGGAGGAGGAATCTATGGTTCCTAGTGGGCCTGACGAGGATTTAACACCACCTGTTATTAATACTGAAAAATCTTATGGTATCCTAGAAGAATCAACAGTAATTGTCGATTACCAGGGCAGAAGAAAGGTGGTGAGAAGCTTGACACTTGAAGAGGCAACAGATACAATTCTTTTCTGCAGCTCTATTGTTCATGATCTAGCCTATTCAGCTGCTTCCATAGCAATTGAAAAGGAAAACGAGGTTACATTGGAAGGCTCACGGCCAACAGTTACCATTTTGGGAAAATCTAACACTGACAGAAGCGATCTACGCAGTAGAACAGGCGGCAAACGGGTTATGAAATCTCAAAAACTGAGACAACGGCATGTGGAAATGAGTACAAAGCCTCCTGTTACGAAGACAGAGAACGACGAGAACACCGACGAGTCCACCATTCGAAATGTAGGTCTTCCTAACCAAGTGGACAGCATGAAACCTCTAAAGCTGGAATCCAAATGCAACTGCAGTATAATG

Protein sequence

MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAEDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQYKYSRQSMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTTNGAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENNINLCPECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM
Homology
BLAST of MS000966 vs. NCBI nr
Match: XP_022131331.1 (uncharacterized protein LOC111004588 isoform X1 [Momordica charantia] >XP_022131332.1 uncharacterized protein LOC111004588 isoform X1 [Momordica charantia])

HSP 1 Score: 2125.1 bits (5505), Expect = 0.0e+00
Identity = 1138/1176 (96.77%), Postives = 1139/1176 (96.85%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED
Sbjct: 3    MPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 62

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 120
            LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP
Sbjct: 63   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 122

Query: 121  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 180
            PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS
Sbjct: 123  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 182

Query: 181  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 240
            APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG
Sbjct: 183  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 242

Query: 241  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 300
            ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM
Sbjct: 243  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 302

Query: 301  DLQYKYSRQSMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 360
            DLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS
Sbjct: 303  DLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 362

Query: 361  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTTN 420
            KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL                      
Sbjct: 363  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL---------------------- 422

Query: 421  GAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 480
                     DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA
Sbjct: 423  ---------DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 482

Query: 481  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 540
            SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC
Sbjct: 483  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 542

Query: 541  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENNI 600
            DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEEN I
Sbjct: 543  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENXI 602

Query: 601  NLCPECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 660
            NLCPECSRKEKYLGMTLLENMT VTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG
Sbjct: 603  NLCPECSRKEKYLGMTLLENMTXVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 662

Query: 661  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 720
            ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN
Sbjct: 663  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 722

Query: 721  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 780
            TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA
Sbjct: 723  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 782

Query: 781  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 840
            RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE
Sbjct: 783  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 842

Query: 841  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 900
            AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS
Sbjct: 843  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 902

Query: 901  SIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 960
            SIVVAVVEEDKFECDN RILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD
Sbjct: 903  SIVVAVVEEDKFECDNRRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 962

Query: 961  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVPSGPDEDLT 1020
            VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ DLSIISKSLLEEESMVPSGPDEDLT
Sbjct: 963  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLT 1022

Query: 1021 PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAI 1080
            PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHD+AYSAASIAI
Sbjct: 1023 PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI 1082

Query: 1081 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1140
            EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE
Sbjct: 1083 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1142

Query: 1141 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM
Sbjct: 1143 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1147

BLAST of MS000966 vs. NCBI nr
Match: XP_022131333.1 (uncharacterized protein LOC111004588 isoform X2 [Momordica charantia])

HSP 1 Score: 2118.6 bits (5488), Expect = 0.0e+00
Identity = 1137/1176 (96.68%), Postives = 1138/1176 (96.77%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED
Sbjct: 3    MPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 62

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 120
            LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP
Sbjct: 63   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 122

Query: 121  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 180
            PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS
Sbjct: 123  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 182

Query: 181  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 240
            APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG
Sbjct: 183  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 242

Query: 241  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 300
            ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM
Sbjct: 243  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 302

Query: 301  DLQYKYSRQSMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 360
            DLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS
Sbjct: 303  DLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 362

Query: 361  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTTN 420
            KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL                      
Sbjct: 363  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL---------------------- 422

Query: 421  GAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 480
                     DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA
Sbjct: 423  ---------DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 482

Query: 481  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 540
            SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC
Sbjct: 483  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 542

Query: 541  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENNI 600
            DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEEN I
Sbjct: 543  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENXI 602

Query: 601  NLCPECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 660
            NLCPECSRKEKYLGMTLLENMT VTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG
Sbjct: 603  NLCPECSRKEKYLGMTLLENMTXVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 662

Query: 661  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 720
            ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN
Sbjct: 663  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 722

Query: 721  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 780
            TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA
Sbjct: 723  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 782

Query: 781  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 840
            RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE
Sbjct: 783  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 842

Query: 841  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 900
            AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS
Sbjct: 843  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 902

Query: 901  SIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 960
            SIVVAVVEEDKFECDN RILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD
Sbjct: 903  SIVVAVVEEDKFECDNRRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 962

Query: 961  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVPSGPDEDLT 1020
            VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ DLSIISKSLLEEESMVPSGPDEDLT
Sbjct: 963  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLT 1022

Query: 1021 PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAI 1080
            PPVINTEKSYGIL ESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHD+AYSAASIAI
Sbjct: 1023 PPVINTEKSYGIL-ESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI 1082

Query: 1081 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1140
            EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE
Sbjct: 1083 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1142

Query: 1141 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM
Sbjct: 1143 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1146

BLAST of MS000966 vs. NCBI nr
Match: XP_022131334.1 (uncharacterized protein LOC111004588 isoform X3 [Momordica charantia])

HSP 1 Score: 2000.3 bits (5181), Expect = 0.0e+00
Identity = 1074/1113 (96.50%), Postives = 1076/1113 (96.68%), Query Frame = 0

Query: 64   SFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 123
            S +TKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT
Sbjct: 32   SAATKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 91

Query: 124  LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 183
            LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH
Sbjct: 92   LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 151

Query: 184  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 243
            SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG
Sbjct: 152  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 211

Query: 244  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 303
            TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ
Sbjct: 212  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 271

Query: 304  YKYSRQSMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 363
            YKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG
Sbjct: 272  YKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 331

Query: 364  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTTNGAY 423
            NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL                         
Sbjct: 332  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL------------------------- 391

Query: 424  ALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIV 483
                  DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIV
Sbjct: 392  ------DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIV 451

Query: 484  LDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPV 543
            LDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPV
Sbjct: 452  LDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPV 511

Query: 544  LTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENNINLC 603
            LTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEEN INLC
Sbjct: 512  LTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENXINLC 571

Query: 604  PECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESR 663
            PECSRKEKYLGMTLLENMT VTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESR
Sbjct: 572  PECSRKEKYLGMTLLENMTXVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESR 631

Query: 664  ISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTES 723
            ISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTES
Sbjct: 632  ISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTES 691

Query: 724  GYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDS 783
            GYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDS
Sbjct: 692  GYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDS 751

Query: 784  MSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAAS 843
            MSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAAS
Sbjct: 752  MSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAAS 811

Query: 844  SGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIV 903
            SGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIV
Sbjct: 812  SGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIV 871

Query: 904  VAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSA 963
            VAVVEEDKFECDN RILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSA
Sbjct: 872  VAVVEEDKFECDNRRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSA 931

Query: 964  FEDEQHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVPSGPDEDLTPPV 1023
            FEDEQHPNHLMTTISEKETKQIAEVIAPGSQ DLSIISKSLLEEESMVPSGPDEDLTPPV
Sbjct: 932  FEDEQHPNHLMTTISEKETKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLTPPV 991

Query: 1024 INTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAIEKE 1083
            INTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHD+AYSAASIAIEKE
Sbjct: 992  INTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAIEKE 1051

Query: 1084 NEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDE 1143
            NEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDE
Sbjct: 1052 NEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDE 1111

Query: 1144 NTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            NTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM
Sbjct: 1112 NTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1113

BLAST of MS000966 vs. NCBI nr
Match: XP_031737323.1 (serine/arginine repetitive matrix protein 2 isoform X1 [Cucumis sativus] >KGN62363.1 hypothetical protein Csa_018751 [Cucumis sativus])

HSP 1 Score: 1688.3 bits (4371), Expect = 0.0e+00
Identity = 944/1192 (79.19%), Postives = 1013/1192 (84.98%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPGRE RGSNHKRGHSFES + IREKDDDLALFNEMQTRERE FLLQSAED
Sbjct: 3    MPPSPALRSSPGRESRGSNHKRGHSFESAVRIREKDDDLALFNEMQTREREGFLLQSAED 62

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSELL-NVDGEKNDYDWLLTPPDTPLFPSLDNDP 120
            LEDSFSTKLRHFSD+KLGISIPVRGENS+LL NV+ EKNDYDWLLTPPDTPLFPSLD++P
Sbjct: 63   LEDSFSTKLRHFSDLKLGISIPVRGENSDLLNNVEAEKNDYDWLLTPPDTPLFPSLDDEP 122

Query: 121  PPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQL 180
            P V +ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGRQL
Sbjct: 123  PSVAIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGRQL 182

Query: 181  SAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTS 240
            SAPHSSPTPSLRHATPSRRSTTP RRS PPPSTPS SV RSSTPTPRRLSTGSSGT+  S
Sbjct: 183  SAPHSSPTPSLRHATPSRRSTTPTRRSPPPPSTPSTSVPRSSTPTPRRLSTGSSGTAGIS 242

Query: 241  GARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNS 300
            GARGTSPIK+VRGNSASPKIRAWQTNIPGFSSD PPNLRTSL DRPASY RGSSPASRNS
Sbjct: 243  GARGTSPIKSVRGNSASPKIRAWQTNIPGFSSDPPPNLRTSLDDRPASYVRGSSPASRNS 302

Query: 301  MDLQYKYSRQSMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDN 360
             DL +KY RQSMSPTA   ISSSHSHDRDRYSSYSRGS ASSGDDDLDSLQSIP SSLDN
Sbjct: 303  RDLAHKYGRQSMSPTASRSISSSHSHDRDRYSSYSRGSIASSGDDDLDSLQSIPISSLDN 362

Query: 361  SLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYM 420
            SLSKGG +FSNNKAL  SKKHRIVSS S PKRSLDSTIR L                   
Sbjct: 363  SLSKGGISFSNNKALAFSKKHRIVSS-SAPKRSLDSTIRHL------------------- 422

Query: 421  TTNGAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSY 480
                        DRKSPNMFRPLLSSVPSTTFYTGK SSAHRSLISRNSSVTTSSNASS 
Sbjct: 423  ------------DRKSPNMFRPLLSSVPSTTFYTGKASSAHRSLISRNSSVTTSSNASSD 482

Query: 481  NGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPA 540
            +G  I LDTE SD NQDDM NECEK+ YH+ HEEIFAFDKMDIV+E+PI+DIKSLDSGPA
Sbjct: 483  HGTCIALDTEGSDQNQDDMVNECEKIQYHNSHEEIFAFDKMDIVDEDPIHDIKSLDSGPA 542

Query: 541  PGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEE 600
             GCDPV+T DSS++ ++P+ISST DSS  QG  FSE+VCL+D ++C RCGCRY V DTEE
Sbjct: 543  LGCDPVVTGDSSYEAVVPDISSTSDSSHVQGADFSEIVCLEDTVVCSRCGCRYRVTDTEE 602

Query: 601  NNINLCPECSRKEKYLGMTLLENMTSVTESISGY-SIKYEAGKPFNKVESGVISLESSLA 660
            N+ NLCPECSR+EK L + + ENMT+VTES+SG  S+KYE  KPF+KVE  VIS +S+LA
Sbjct: 603  NDANLCPECSREEKCLSLAISENMTAVTESLSGLSSVKYE-DKPFDKVELVVISPDSALA 662

Query: 661  TDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSG 720
             DLGESRIS  +GNVEQDQASYPE+G SY  ENFP+ETP  ESQHSLINH EIGQ  VSG
Sbjct: 663  NDLGESRISMFVGNVEQDQASYPEQGPSY-VENFPAETPSEESQHSLINHLEIGQSAVSG 722

Query: 721  SQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDD 780
            +Q +T SGYQQPL  NDY+ LRFDS EGAGISILLKRSSSSKGP+VQGRTFT STISYDD
Sbjct: 723  NQPDTGSGYQQPLQRNDYQSLRFDSPEGAGISILLKRSSSSKGPVVQGRTFTASTISYDD 782

Query: 781  LSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQ--VSSRKGELESKKGEICV 840
            LSFARDSMSSLRSS+GHSSFSASSSADFSSARQIEAR+QRQ  +SSRKGELE+KKGEI V
Sbjct: 783  LSFARDSMSSLRSSIGHSSFSASSSADFSSARQIEARMQRQLSLSSRKGELENKKGEISV 842

Query: 841  KSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENA 900
            KSH +E ASSG P +AHP+ GFETC+Q+EN+DF VANLEC S QGTT SSQK ELASEN 
Sbjct: 843  KSHCAEIASSGIPASAHPISGFETCKQDENVDFYVANLECSSCQGTTTSSQKAELASENG 902

Query: 901  ESDDTSSIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLE 960
            +SDDTSSI VAVVEEDKFE D CRILDTCTSE SRED SGGRSVSDK+A VT SDCSKLE
Sbjct: 903  KSDDTSSISVAVVEEDKFEYDTCRILDTCTSELSREDSSGGRSVSDKDASVTNSDCSKLE 962

Query: 961  GHNMPDVSAFEDEQH--PNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVP 1020
            GHNM     FEDE+     H M TISE E  QIAEV+A GSQ D+S IS   LEEES+V 
Sbjct: 963  GHNMLG-DVFEDERSEVSTHPMITISETEATQIAEVVASGSQDDISTISMIPLEEESVVL 1022

Query: 1021 SGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLA 1080
            SGPD+DLTP +IN EKS GILEESTVIVDYQG+ KVVRSLTLEEATDTILFCSSIVHDLA
Sbjct: 1023 SGPDQDLTPSIINAEKSDGILEESTVIVDYQGKTKVVRSLTLEEATDTILFCSSIVHDLA 1082

Query: 1081 YSAASIAI--------EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLR 1140
            YSAA+IAI        EKENEVTLE SRP VTILGKSNT+RSDLR RTGGKRVMKSQK R
Sbjct: 1083 YSAATIAIEKEKEKEKEKENEVTLEASRPMVTILGKSNTNRSDLRHRTGGKRVMKSQKPR 1142

Query: 1141 QRHVEMSTKPPVTKTENDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            QR VEMSTKPP+  TENDENTDESTIRNVGLPNQVD+ KP KLESKCNCSIM
Sbjct: 1143 QRRVEMSTKPPIAYTENDENTDESTIRNVGLPNQVDTAKPPKLESKCNCSIM 1159

BLAST of MS000966 vs. NCBI nr
Match: XP_022961987.1 (flocculation protein FLO11 [Cucurbita moschata] >XP_022961988.1 flocculation protein FLO11 [Cucurbita moschata] >XP_022961989.1 flocculation protein FLO11 [Cucurbita moschata])

HSP 1 Score: 1685.2 bits (4363), Expect = 0.0e+00
Identity = 930/1188 (78.28%), Postives = 1006/1188 (84.68%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPG E RGSNHKRGHSFESG  IREKDDDLALFNEMQTRER+ FLLQSAED
Sbjct: 1    MPPSPALRSSPGSEPRGSNHKRGHSFESGARIREKDDDLALFNEMQTRERDDFLLQSAED 60

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSE-LLNVDGEKNDYDWLLTPPDTPLFPSLDNDP 120
             EDSFSTKLRHF D+KLGIS+PVRGENS+ L+N + +KNDYDWLLTPPDTPLFPSLD++P
Sbjct: 61   FEDSFSTKLRHFPDLKLGISVPVRGENSDMLINAETDKNDYDWLLTPPDTPLFPSLDDEP 120

Query: 121  PPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQL 180
            PPVT+ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGRQL
Sbjct: 121  PPVTIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGRQL 180

Query: 181  SAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTS 240
            SAPHSSPTPSLRHATPSRRSTTP RRSSPPPS PS SV RSSTPTPRRLSTGSSG +  S
Sbjct: 181  SAPHSSPTPSLRHATPSRRSTTPTRRSSPPPSMPSTSVPRSSTPTPRRLSTGSSGAAVIS 240

Query: 241  GARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNS 300
            G RGTSP+K+VRGNSASPKIRAWQTNIPGFSS+ PPNLRTSLADRPASY RGSSPASRNS
Sbjct: 241  GTRGTSPVKSVRGNSASPKIRAWQTNIPGFSSEPPPNLRTSLADRPASYVRGSSPASRNS 300

Query: 301  MDLQYKYSRQSMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDN 360
             DL +KY RQSMSPTA   I+S HSHDRD YSSYSRGS ASSGDDDLDSLQS+P S+LDN
Sbjct: 301  RDLAHKYGRQSMSPTASRSITSPHSHDRDHYSSYSRGSIASSGDDDLDSLQSMPISTLDN 360

Query: 361  SLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYM 420
            SLSKGG + SNNKAL +SKKHRIVSS+S PKRSLDSTIRQL                   
Sbjct: 361  SLSKGGISLSNNKALALSKKHRIVSSSSAPKRSLDSTIRQL------------------- 420

Query: 421  TTNGAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSY 480
                        DRKSPNMFRPLLSSVPSTTFYTGK SSAHR LISRNSSVTTSSNASS 
Sbjct: 421  ------------DRKSPNMFRPLLSSVPSTTFYTGKASSAHR-LISRNSSVTTSSNASSD 480

Query: 481  NGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDS--G 540
            +G  I LDTE SD NQ+D  NECEK+PYHD HEEIFAFDKMDIV+E+P + IKSLDS  G
Sbjct: 481  HGTCIALDTEGSDHNQNDTTNECEKMPYHDSHEEIFAFDKMDIVDEDPFHVIKSLDSGRG 540

Query: 541  PAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDT 600
            PA GCDPV+T DSS++ +IP+I ST DSS  QG  FSEVVCL+D  +C RCGCRY VID+
Sbjct: 541  PALGCDPVVTGDSSYEAVIPDIISTSDSSHVQGGDFSEVVCLEDTFVCSRCGCRYRVIDS 600

Query: 601  EENNINLCPECSRKEKYLGMTLLENMTSVTESISGY-SIKYEAGKPFNKVESGVISLESS 660
            EEN +N CPECSR+EK +GM +  N TSVTES+SG  S+KYEA KPFN+V+S VIS +SS
Sbjct: 601  EENTLNCCPECSREEKDIGMAISNNTTSVTESLSGLSSVKYEADKPFNRVDSLVISPDSS 660

Query: 661  LATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVV 720
            LATD GESRIS S+GN+EQDQAS+PE+G SY +ENFPSETPV ESQHSL NH E+GQL V
Sbjct: 661  LATDFGESRISMSVGNIEQDQASFPEQGPSYLEENFPSETPVEESQHSLTNHLEMGQLAV 720

Query: 721  SGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISY 780
            +GSQ NTESG QQPL HNDY+ LRFDSSEGAGISILLKRSSSSKGP+VQGRTFT STISY
Sbjct: 721  NGSQPNTESGCQQPLQHNDYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISY 780

Query: 781  DDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICV 840
            DDLSFARDSMSSLRSS+GHSSFSASSSADFSS+RQIE R+QRQ+SSRKG+LE+KK E+ V
Sbjct: 781  DDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEGRMQRQLSSRKGDLENKKCEVSV 840

Query: 841  KSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENA 900
            KSH SE AS+GTP NAHP+  FETC+QEEN+DF VA LECFSSQGTT SS KPELASENA
Sbjct: 841  KSHCSEVASTGTPANAHPISSFETCKQEENVDFYVATLECFSSQGTTMSSHKPELASENA 900

Query: 901  ESDDTSSIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLE 960
            ESDD SSIV A VEEDK ECD CR LD CTS SSRED SGGRSVSDK+A VTT DCS+LE
Sbjct: 901  ESDDASSIVAAAVEEDKLECDKCRRLDNCTSGSSREDTSGGRSVSDKDASVTTFDCSRLE 960

Query: 961  GHNMPDVSAFEDE--QHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVP 1020
            GHN+ D   FEDE  + P H MTTISE E  QIAEVI PGSQ DLSII  S+  EES VP
Sbjct: 961  GHNILDGDVFEDEHTELPTHPMTTISETEAAQIAEVIGPGSQNDLSII-PSIPLEESAVP 1020

Query: 1021 SGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLA 1080
            SGPD+DL P VINTEKS GILE STVIVDYQGR KV RSLTLEEATDTILFCSSIVHDLA
Sbjct: 1021 SGPDQDLAPSVINTEKSDGILERSTVIVDYQGRTKVGRSLTLEEATDTILFCSSIVHDLA 1080

Query: 1081 YSAASIAI----EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHV 1140
            YSAA+IAI    EKENEVTLE SRP VTILGKS  +R DLR RTGGKRVMKSQK RQR V
Sbjct: 1081 YSAATIAIEKEKEKENEVTLEASRPMVTILGKSYPNRGDLRHRTGGKRVMKSQKPRQRRV 1140

Query: 1141 EMSTKPPVTKTENDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            EMSTKPP+ KTENDENTDESTI+NVGLPNQVDS KP KLESKCNCSIM
Sbjct: 1141 EMSTKPPIAKTENDENTDESTIQNVGLPNQVDSTKPPKLESKCNCSIM 1155

BLAST of MS000966 vs. ExPASy TrEMBL
Match: A0A6J1BQQ5 (uncharacterized protein LOC111004588 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111004588 PE=4 SV=1)

HSP 1 Score: 2125.1 bits (5505), Expect = 0.0e+00
Identity = 1138/1176 (96.77%), Postives = 1139/1176 (96.85%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED
Sbjct: 3    MPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 62

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 120
            LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP
Sbjct: 63   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 122

Query: 121  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 180
            PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS
Sbjct: 123  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 182

Query: 181  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 240
            APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG
Sbjct: 183  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 242

Query: 241  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 300
            ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM
Sbjct: 243  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 302

Query: 301  DLQYKYSRQSMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 360
            DLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS
Sbjct: 303  DLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 362

Query: 361  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTTN 420
            KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL                      
Sbjct: 363  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL---------------------- 422

Query: 421  GAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 480
                     DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA
Sbjct: 423  ---------DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 482

Query: 481  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 540
            SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC
Sbjct: 483  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 542

Query: 541  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENNI 600
            DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEEN I
Sbjct: 543  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENXI 602

Query: 601  NLCPECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 660
            NLCPECSRKEKYLGMTLLENMT VTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG
Sbjct: 603  NLCPECSRKEKYLGMTLLENMTXVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 662

Query: 661  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 720
            ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN
Sbjct: 663  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 722

Query: 721  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 780
            TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA
Sbjct: 723  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 782

Query: 781  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 840
            RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE
Sbjct: 783  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 842

Query: 841  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 900
            AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS
Sbjct: 843  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 902

Query: 901  SIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 960
            SIVVAVVEEDKFECDN RILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD
Sbjct: 903  SIVVAVVEEDKFECDNRRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 962

Query: 961  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVPSGPDEDLT 1020
            VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ DLSIISKSLLEEESMVPSGPDEDLT
Sbjct: 963  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLT 1022

Query: 1021 PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAI 1080
            PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHD+AYSAASIAI
Sbjct: 1023 PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI 1082

Query: 1081 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1140
            EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE
Sbjct: 1083 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1142

Query: 1141 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM
Sbjct: 1143 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1147

BLAST of MS000966 vs. ExPASy TrEMBL
Match: A0A6J1BT22 (uncharacterized protein LOC111004588 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111004588 PE=4 SV=1)

HSP 1 Score: 2118.6 bits (5488), Expect = 0.0e+00
Identity = 1137/1176 (96.68%), Postives = 1138/1176 (96.77%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED
Sbjct: 3    MPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 62

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 120
            LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP
Sbjct: 63   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 122

Query: 121  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 180
            PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS
Sbjct: 123  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 182

Query: 181  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 240
            APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG
Sbjct: 183  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 242

Query: 241  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 300
            ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM
Sbjct: 243  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 302

Query: 301  DLQYKYSRQSMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 360
            DLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS
Sbjct: 303  DLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLS 362

Query: 361  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTTN 420
            KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL                      
Sbjct: 363  KGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL---------------------- 422

Query: 421  GAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 480
                     DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA
Sbjct: 423  ---------DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGA 482

Query: 481  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 540
            SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC
Sbjct: 483  SIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGC 542

Query: 541  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENNI 600
            DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEEN I
Sbjct: 543  DPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENXI 602

Query: 601  NLCPECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 660
            NLCPECSRKEKYLGMTLLENMT VTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG
Sbjct: 603  NLCPECSRKEKYLGMTLLENMTXVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLG 662

Query: 661  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 720
            ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN
Sbjct: 663  ESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSN 722

Query: 721  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 780
            TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA
Sbjct: 723  TESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFA 782

Query: 781  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 840
            RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE
Sbjct: 783  RDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISE 842

Query: 841  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 900
            AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS
Sbjct: 843  AASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTS 902

Query: 901  SIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 960
            SIVVAVVEEDKFECDN RILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD
Sbjct: 903  SIVVAVVEEDKFECDNRRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPD 962

Query: 961  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVPSGPDEDLT 1020
            VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ DLSIISKSLLEEESMVPSGPDEDLT
Sbjct: 963  VSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLT 1022

Query: 1021 PPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAI 1080
            PPVINTEKSYGIL ESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHD+AYSAASIAI
Sbjct: 1023 PPVINTEKSYGIL-ESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI 1082

Query: 1081 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1140
            EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE
Sbjct: 1083 EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTE 1142

Query: 1141 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM
Sbjct: 1143 NDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1146

BLAST of MS000966 vs. ExPASy TrEMBL
Match: A0A6J1BPZ7 (uncharacterized protein LOC111004588 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111004588 PE=4 SV=1)

HSP 1 Score: 2000.3 bits (5181), Expect = 0.0e+00
Identity = 1074/1113 (96.50%), Postives = 1076/1113 (96.68%), Query Frame = 0

Query: 64   SFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 123
            S +TKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT
Sbjct: 32   SAATKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 91

Query: 124  LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 183
            LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH
Sbjct: 92   LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 151

Query: 184  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 243
            SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG
Sbjct: 152  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 211

Query: 244  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 303
            TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ
Sbjct: 212  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 271

Query: 304  YKYSRQSMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 363
            YKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG
Sbjct: 272  YKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 331

Query: 364  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTTNGAY 423
            NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL                         
Sbjct: 332  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQL------------------------- 391

Query: 424  ALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIV 483
                  DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIV
Sbjct: 392  ------DRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIV 451

Query: 484  LDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPV 543
            LDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPV
Sbjct: 452  LDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPV 511

Query: 544  LTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENNINLC 603
            LTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEEN INLC
Sbjct: 512  LTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENXINLC 571

Query: 604  PECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESR 663
            PECSRKEKYLGMTLLENMT VTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESR
Sbjct: 572  PECSRKEKYLGMTLLENMTXVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESR 631

Query: 664  ISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTES 723
            ISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTES
Sbjct: 632  ISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTES 691

Query: 724  GYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDS 783
            GYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDS
Sbjct: 692  GYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDS 751

Query: 784  MSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAAS 843
            MSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAAS
Sbjct: 752  MSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAAS 811

Query: 844  SGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIV 903
            SGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIV
Sbjct: 812  SGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIV 871

Query: 904  VAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSA 963
            VAVVEEDKFECDN RILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSA
Sbjct: 872  VAVVEEDKFECDNRRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSA 931

Query: 964  FEDEQHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVPSGPDEDLTPPV 1023
            FEDEQHPNHLMTTISEKETKQIAEVIAPGSQ DLSIISKSLLEEESMVPSGPDEDLTPPV
Sbjct: 932  FEDEQHPNHLMTTISEKETKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLTPPV 991

Query: 1024 INTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAIEKE 1083
            INTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHD+AYSAASIAIEKE
Sbjct: 992  INTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAIEKE 1051

Query: 1084 NEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDE 1143
            NEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDE
Sbjct: 1052 NEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDE 1111

Query: 1144 NTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            NTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM
Sbjct: 1112 NTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1113

BLAST of MS000966 vs. ExPASy TrEMBL
Match: A0A0A0LKP5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G350460 PE=4 SV=1)

HSP 1 Score: 1688.3 bits (4371), Expect = 0.0e+00
Identity = 944/1192 (79.19%), Postives = 1013/1192 (84.98%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPGRE RGSNHKRGHSFES + IREKDDDLALFNEMQTRERE FLLQSAED
Sbjct: 3    MPPSPALRSSPGRESRGSNHKRGHSFESAVRIREKDDDLALFNEMQTREREGFLLQSAED 62

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSELL-NVDGEKNDYDWLLTPPDTPLFPSLDNDP 120
            LEDSFSTKLRHFSD+KLGISIPVRGENS+LL NV+ EKNDYDWLLTPPDTPLFPSLD++P
Sbjct: 63   LEDSFSTKLRHFSDLKLGISIPVRGENSDLLNNVEAEKNDYDWLLTPPDTPLFPSLDDEP 122

Query: 121  PPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQL 180
            P V +ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGRQL
Sbjct: 123  PSVAIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGRQL 182

Query: 181  SAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTS 240
            SAPHSSPTPSLRHATPSRRSTTP RRS PPPSTPS SV RSSTPTPRRLSTGSSGT+  S
Sbjct: 183  SAPHSSPTPSLRHATPSRRSTTPTRRSPPPPSTPSTSVPRSSTPTPRRLSTGSSGTAGIS 242

Query: 241  GARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNS 300
            GARGTSPIK+VRGNSASPKIRAWQTNIPGFSSD PPNLRTSL DRPASY RGSSPASRNS
Sbjct: 243  GARGTSPIKSVRGNSASPKIRAWQTNIPGFSSDPPPNLRTSLDDRPASYVRGSSPASRNS 302

Query: 301  MDLQYKYSRQSMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDN 360
             DL +KY RQSMSPTA   ISSSHSHDRDRYSSYSRGS ASSGDDDLDSLQSIP SSLDN
Sbjct: 303  RDLAHKYGRQSMSPTASRSISSSHSHDRDRYSSYSRGSIASSGDDDLDSLQSIPISSLDN 362

Query: 361  SLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYM 420
            SLSKGG +FSNNKAL  SKKHRIVSS S PKRSLDSTIR L                   
Sbjct: 363  SLSKGGISFSNNKALAFSKKHRIVSS-SAPKRSLDSTIRHL------------------- 422

Query: 421  TTNGAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSY 480
                        DRKSPNMFRPLLSSVPSTTFYTGK SSAHRSLISRNSSVTTSSNASS 
Sbjct: 423  ------------DRKSPNMFRPLLSSVPSTTFYTGKASSAHRSLISRNSSVTTSSNASSD 482

Query: 481  NGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPA 540
            +G  I LDTE SD NQDDM NECEK+ YH+ HEEIFAFDKMDIV+E+PI+DIKSLDSGPA
Sbjct: 483  HGTCIALDTEGSDQNQDDMVNECEKIQYHNSHEEIFAFDKMDIVDEDPIHDIKSLDSGPA 542

Query: 541  PGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEE 600
             GCDPV+T DSS++ ++P+ISST DSS  QG  FSE+VCL+D ++C RCGCRY V DTEE
Sbjct: 543  LGCDPVVTGDSSYEAVVPDISSTSDSSHVQGADFSEIVCLEDTVVCSRCGCRYRVTDTEE 602

Query: 601  NNINLCPECSRKEKYLGMTLLENMTSVTESISGY-SIKYEAGKPFNKVESGVISLESSLA 660
            N+ NLCPECSR+EK L + + ENMT+VTES+SG  S+KYE  KPF+KVE  VIS +S+LA
Sbjct: 603  NDANLCPECSREEKCLSLAISENMTAVTESLSGLSSVKYE-DKPFDKVELVVISPDSALA 662

Query: 661  TDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSG 720
             DLGESRIS  +GNVEQDQASYPE+G SY  ENFP+ETP  ESQHSLINH EIGQ  VSG
Sbjct: 663  NDLGESRISMFVGNVEQDQASYPEQGPSY-VENFPAETPSEESQHSLINHLEIGQSAVSG 722

Query: 721  SQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDD 780
            +Q +T SGYQQPL  NDY+ LRFDS EGAGISILLKRSSSSKGP+VQGRTFT STISYDD
Sbjct: 723  NQPDTGSGYQQPLQRNDYQSLRFDSPEGAGISILLKRSSSSKGPVVQGRTFTASTISYDD 782

Query: 781  LSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQ--VSSRKGELESKKGEICV 840
            LSFARDSMSSLRSS+GHSSFSASSSADFSSARQIEAR+QRQ  +SSRKGELE+KKGEI V
Sbjct: 783  LSFARDSMSSLRSSIGHSSFSASSSADFSSARQIEARMQRQLSLSSRKGELENKKGEISV 842

Query: 841  KSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENA 900
            KSH +E ASSG P +AHP+ GFETC+Q+EN+DF VANLEC S QGTT SSQK ELASEN 
Sbjct: 843  KSHCAEIASSGIPASAHPISGFETCKQDENVDFYVANLECSSCQGTTTSSQKAELASENG 902

Query: 901  ESDDTSSIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLE 960
            +SDDTSSI VAVVEEDKFE D CRILDTCTSE SRED SGGRSVSDK+A VT SDCSKLE
Sbjct: 903  KSDDTSSISVAVVEEDKFEYDTCRILDTCTSELSREDSSGGRSVSDKDASVTNSDCSKLE 962

Query: 961  GHNMPDVSAFEDEQH--PNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVP 1020
            GHNM     FEDE+     H M TISE E  QIAEV+A GSQ D+S IS   LEEES+V 
Sbjct: 963  GHNMLG-DVFEDERSEVSTHPMITISETEATQIAEVVASGSQDDISTISMIPLEEESVVL 1022

Query: 1021 SGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLA 1080
            SGPD+DLTP +IN EKS GILEESTVIVDYQG+ KVVRSLTLEEATDTILFCSSIVHDLA
Sbjct: 1023 SGPDQDLTPSIINAEKSDGILEESTVIVDYQGKTKVVRSLTLEEATDTILFCSSIVHDLA 1082

Query: 1081 YSAASIAI--------EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLR 1140
            YSAA+IAI        EKENEVTLE SRP VTILGKSNT+RSDLR RTGGKRVMKSQK R
Sbjct: 1083 YSAATIAIEKEKEKEKEKENEVTLEASRPMVTILGKSNTNRSDLRHRTGGKRVMKSQKPR 1142

Query: 1141 QRHVEMSTKPPVTKTENDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            QR VEMSTKPP+  TENDENTDESTIRNVGLPNQVD+ KP KLESKCNCSIM
Sbjct: 1143 QRRVEMSTKPPIAYTENDENTDESTIRNVGLPNQVDTAKPPKLESKCNCSIM 1159

BLAST of MS000966 vs. ExPASy TrEMBL
Match: A0A6J1HBJ8 (flocculation protein FLO11 OS=Cucurbita moschata OX=3662 GN=LOC111462594 PE=4 SV=1)

HSP 1 Score: 1685.2 bits (4363), Expect = 0.0e+00
Identity = 930/1188 (78.28%), Postives = 1006/1188 (84.68%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALRSSPG E RGSNHKRGHSFESG  IREKDDDLALFNEMQTRER+ FLLQSAED
Sbjct: 1    MPPSPALRSSPGSEPRGSNHKRGHSFESGARIREKDDDLALFNEMQTRERDDFLLQSAED 60

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSE-LLNVDGEKNDYDWLLTPPDTPLFPSLDNDP 120
             EDSFSTKLRHF D+KLGIS+PVRGENS+ L+N + +KNDYDWLLTPPDTPLFPSLD++P
Sbjct: 61   FEDSFSTKLRHFPDLKLGISVPVRGENSDMLINAETDKNDYDWLLTPPDTPLFPSLDDEP 120

Query: 121  PPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQL 180
            PPVT+ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGRQL
Sbjct: 121  PPVTIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGRQL 180

Query: 181  SAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTS 240
            SAPHSSPTPSLRHATPSRRSTTP RRSSPPPS PS SV RSSTPTPRRLSTGSSG +  S
Sbjct: 181  SAPHSSPTPSLRHATPSRRSTTPTRRSSPPPSMPSTSVPRSSTPTPRRLSTGSSGAAVIS 240

Query: 241  GARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNS 300
            G RGTSP+K+VRGNSASPKIRAWQTNIPGFSS+ PPNLRTSLADRPASY RGSSPASRNS
Sbjct: 241  GTRGTSPVKSVRGNSASPKIRAWQTNIPGFSSEPPPNLRTSLADRPASYVRGSSPASRNS 300

Query: 301  MDLQYKYSRQSMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDN 360
             DL +KY RQSMSPTA   I+S HSHDRD YSSYSRGS ASSGDDDLDSLQS+P S+LDN
Sbjct: 301  RDLAHKYGRQSMSPTASRSITSPHSHDRDHYSSYSRGSIASSGDDDLDSLQSMPISTLDN 360

Query: 361  SLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYM 420
            SLSKGG + SNNKAL +SKKHRIVSS+S PKRSLDSTIRQL                   
Sbjct: 361  SLSKGGISLSNNKALALSKKHRIVSSSSAPKRSLDSTIRQL------------------- 420

Query: 421  TTNGAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSY 480
                        DRKSPNMFRPLLSSVPSTTFYTGK SSAHR LISRNSSVTTSSNASS 
Sbjct: 421  ------------DRKSPNMFRPLLSSVPSTTFYTGKASSAHR-LISRNSSVTTSSNASSD 480

Query: 481  NGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDS--G 540
            +G  I LDTE SD NQ+D  NECEK+PYHD HEEIFAFDKMDIV+E+P + IKSLDS  G
Sbjct: 481  HGTCIALDTEGSDHNQNDTTNECEKMPYHDSHEEIFAFDKMDIVDEDPFHVIKSLDSGRG 540

Query: 541  PAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDT 600
            PA GCDPV+T DSS++ +IP+I ST DSS  QG  FSEVVCL+D  +C RCGCRY VID+
Sbjct: 541  PALGCDPVVTGDSSYEAVIPDIISTSDSSHVQGGDFSEVVCLEDTFVCSRCGCRYRVIDS 600

Query: 601  EENNINLCPECSRKEKYLGMTLLENMTSVTESISGY-SIKYEAGKPFNKVESGVISLESS 660
            EEN +N CPECSR+EK +GM +  N TSVTES+SG  S+KYEA KPFN+V+S VIS +SS
Sbjct: 601  EENTLNCCPECSREEKDIGMAISNNTTSVTESLSGLSSVKYEADKPFNRVDSLVISPDSS 660

Query: 661  LATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVV 720
            LATD GESRIS S+GN+EQDQAS+PE+G SY +ENFPSETPV ESQHSL NH E+GQL V
Sbjct: 661  LATDFGESRISMSVGNIEQDQASFPEQGPSYLEENFPSETPVEESQHSLTNHLEMGQLAV 720

Query: 721  SGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISY 780
            +GSQ NTESG QQPL HNDY+ LRFDSSEGAGISILLKRSSSSKGP+VQGRTFT STISY
Sbjct: 721  NGSQPNTESGCQQPLQHNDYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISY 780

Query: 781  DDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICV 840
            DDLSFARDSMSSLRSS+GHSSFSASSSADFSS+RQIE R+QRQ+SSRKG+LE+KK E+ V
Sbjct: 781  DDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEGRMQRQLSSRKGDLENKKCEVSV 840

Query: 841  KSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENA 900
            KSH SE AS+GTP NAHP+  FETC+QEEN+DF VA LECFSSQGTT SS KPELASENA
Sbjct: 841  KSHCSEVASTGTPANAHPISSFETCKQEENVDFYVATLECFSSQGTTMSSHKPELASENA 900

Query: 901  ESDDTSSIVVAVVEEDKFECDNCRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLE 960
            ESDD SSIV A VEEDK ECD CR LD CTS SSRED SGGRSVSDK+A VTT DCS+LE
Sbjct: 901  ESDDASSIVAAAVEEDKLECDKCRRLDNCTSGSSREDTSGGRSVSDKDASVTTFDCSRLE 960

Query: 961  GHNMPDVSAFEDE--QHPNHLMTTISEKETKQIAEVIAPGSQGDLSIISKSLLEEESMVP 1020
            GHN+ D   FEDE  + P H MTTISE E  QIAEVI PGSQ DLSII  S+  EES VP
Sbjct: 961  GHNILDGDVFEDEHTELPTHPMTTISETEAAQIAEVIGPGSQNDLSII-PSIPLEESAVP 1020

Query: 1021 SGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLA 1080
            SGPD+DL P VINTEKS GILE STVIVDYQGR KV RSLTLEEATDTILFCSSIVHDLA
Sbjct: 1021 SGPDQDLAPSVINTEKSDGILERSTVIVDYQGRTKVGRSLTLEEATDTILFCSSIVHDLA 1080

Query: 1081 YSAASIAI----EKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHV 1140
            YSAA+IAI    EKENEVTLE SRP VTILGKS  +R DLR RTGGKRVMKSQK RQR V
Sbjct: 1081 YSAATIAIEKEKEKENEVTLEASRPMVTILGKSYPNRGDLRHRTGGKRVMKSQKPRQRRV 1140

Query: 1141 EMSTKPPVTKTENDENTDESTIRNVGLPNQVDSMKPLKLESKCNCSIM 1177
            EMSTKPP+ KTENDENTDESTI+NVGLPNQVDS KP KLESKCNCSIM
Sbjct: 1141 EMSTKPPIAKTENDENTDESTIQNVGLPNQVDSTKPPKLESKCNCSIM 1155

BLAST of MS000966 vs. TAIR 10
Match: AT1G27850.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G40070.1); Has 9215 Blast hits to 5316 proteins in 473 species: Archae - 6; Bacteria - 773; Metazoa - 3392; Fungi - 1710; Plants - 539; Viruses - 143; Other Eukaryotes - 2652 (source: NCBI BLink). )

HSP 1 Score: 622.5 bits (1604), Expect = 7.1e-178
Identity = 492/1234 (39.87%), Postives = 678/1234 (54.94%), Query Frame = 0

Query: 1    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 60
            MPPSPALR SPGREL G  H+RGHS E G+  R+KDDDLALF+EMQ +ER+SFLLQS++D
Sbjct: 1    MPPSPALRCSPGRELPGKKHRRGHSIEYGILFRDKDDDLALFSEMQDKERDSFLLQSSDD 60

Query: 61   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 120
            LED FSTKL+HFS+     +IPV+GE+S LL  +G+KNDYDWLLTPPDTPLFPSLD+ PP
Sbjct: 61   LEDVFSTKLKHFSE----FTIPVQGESSRLLTAEGDKNDYDWLLTPPDTPLFPSLDDQPP 120

Query: 121  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 180
              ++  RGRP+SQ IS+SRSSTMEKS RS  S+GSASPNRLS SPR A ++ Q+RGR  S
Sbjct: 121  AASVVRRGRPQSQ-ISLSRSSTMEKSRRS--SKGSASPNRLSTSPR-ADNMQQIRGRPSS 180

Query: 181  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 240
            A H SP          RRS TP RR SP P  PS  V+RS TPT RR+STGS+ T  +  
Sbjct: 181  ARHPSP-------ASGRRSGTPVRRISPTPGKPSGPVSRSPTPTSRRMSTGST-TMASPA 240

Query: 241  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 300
             RGTSP+ + RGNS SPKI+ WQ+NIPGFS DAPPNLRTSL DRPASY RGSSPASRN  
Sbjct: 241  VRGTSPVSSSRGNSPSPKIKVWQSNIPGFSLDAPPNLRTSLGDRPASYVRGSSPASRNGR 300

Query: 301  DLQYKYSRQSMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360
            D     SR+S+SP+A   +SSSHSH+RDR+SS S+GS ASSGDDDL SLQSIP    + +
Sbjct: 301  DAVSTRSRKSVSPSASRSVSSSHSHERDRFSSQSKGSVASSGDDDLHSLQSIPVGGSERA 360

Query: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMT 420
            +SK  +   N++    S+  +++S  S P+R  +S +RQ+                    
Sbjct: 361  VSKRASLSPNSRT---SRSSKLLSPGSAPRRPFESALRQME------------------- 420

Query: 421  TNGAYALIFLQDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYN 480
                        +   +MFRPL SS+PST  Y+GK SS++  ++ R+S+ T  SN+SS  
Sbjct: 421  ----------HPKSHHSMFRPLASSLPSTGIYSGKGSSSYHHIMLRHSTATVGSNSSSGQ 480

Query: 481  GASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPIND---------I 540
                + D +  D       +E E + Y D HEE  AF  +++ NE+  ++         +
Sbjct: 481  VTGFMPDAKGMD-PVPVFQSEVENLAYPDKHEESIAFGMVNLSNESSRHESHESSFSDQL 540

Query: 541  KSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCR 600
              +D      C+    E+ SHQ    E SST  S    GN F E V L+ + +C RCG  
Sbjct: 541  GDMDQDYTVECESSANEEVSHQVFDVENSSTHGSLHV-GNEFLEGVALETMEVCGRCGSH 600

Query: 601  YCVIDTEENNINLCPECSRKEKYLGM-TLLENMTSVTESISGYSIKYEAGKPFNKVESGV 660
            YC  +   + IN+CPEC  +  ++   +   N   ++++I   +  Y    P       V
Sbjct: 601  YCATEATRSEINICPECREEHSFVETDSPGTNSPKLSQTIFDENKLYFENIP-------V 660

Query: 661  ISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSE 720
            I +  SL   + E  I E+   +EQ   SY +E   Y          + E    ++N+ +
Sbjct: 661  IDVLDSLPVVMVEEEILETPEKIEQCDNSYEQE--QYHLYESSISRALEEQNVDMLNYKD 720

Query: 721  IGQLVVSGSQSNTESG-------YQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPI 780
                   G+QS+T  G         Q    + + D+   S     + +++KRS S K P+
Sbjct: 721  -------GTQSSTGCGPLSIGTKDTQTQLSDKHHDVNIGSLGRGDVPLVIKRSVSMKSPV 780

Query: 781  VQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSR 840
            +Q    +  T SY+  S++RD   SLRSS    + SASSS D+ S+ +  + I RQ S  
Sbjct: 781  IQANNSSCFTRSYEGFSYSRDRSISLRSST--ETASASSSWDYGSSIRKGSHI-RQRSGS 840

Query: 841  KGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTT 900
              +LE+ + +   KS  + ++SSG  ++    L       E++ +   A + C   +   
Sbjct: 841  TLDLETHRYDTNSKSLSTMSSSSGMSSHTFQAL---NVMPEDSFEMCAAQMTCTLDETHQ 900

Query: 901  NSSQKPELASENAESDDTSSIVVAVVEE----------------------DKFECDNCR- 960
             S  +P    +N E  +T+ +    VE                       D   C+N   
Sbjct: 901  ESHTEP----QNLECKETNVMNADFVESVGLVRISANVLGDLAEHNPVVMDDECCENGND 960

Query: 961  ILDTCTSE-SSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTI 1020
            + +T  S+  +RE  +  RS SD  A   T DC   +   + +    E    P+ L TT 
Sbjct: 961  VANTVISKGETRESPAHIRSTSDLGASPITDDCPFNDHSRLQENDVNET---PHGLSTTT 1020

Query: 1021 SEKETKQIAEVIAPG-------SQGDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSY 1080
            + +   + +E   PG        + + ++ +     E+SMV +  D   +   +N     
Sbjct: 1021 ASEIEPESSEPEIPGLGVHDEIPESERNLNAVDDCSEKSMVHASVDHHSSSAPVNE---- 1080

Query: 1081 GILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDLAYSAASIAIEKENEVTLEG 1140
             IL+ESTV+V+  G  K  RSLTLEEATDTILFCSSIVHDL Y AA+IA++K  +V  E 
Sbjct: 1081 -ILDESTVLVECPG-GKEPRSLTLEEATDTILFCSSIVHDLVYQAATIAMDKAKDVPAEE 1140

Query: 1141 S--RPTVTILGKSNTDRSDLRSRTG---GKRVMKSQKLRQRHVEMSTKPPVTKTENDENT 1177
                PTVT+LGKSN +R+      G    KR  K+ K  ++  E   K  V + ENDEN 
Sbjct: 1141 EMLHPTVTVLGKSNANRNSYGLGGGTKAKKRSSKAAKASRKQTETEEKTEV-QIENDENA 1148

BLAST of MS000966 vs. TAIR 10
Match: AT2G40070.1 (BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 85.1 bits (209), Expect = 4.1e-16
Identity = 162/578 (28.03%), Postives = 243/578 (42.04%), Query Frame = 0

Query: 34  EKDDDLALFNEMQTRERES---FLLQSAEDLEDSFSTK--LRHFSDIKLGISIPVRGENS 93
           EKD++L+LF EM+ RE+E     L  + ++ E    +K       +I  G     +    
Sbjct: 30  EKDEELSLFLEMRRREKEQDNLLLNNNPDEFETPLGSKHGTSPVFNISSGAPPSRKAAPD 89

Query: 94  ELLNVDGEKNDYDWLLTPPDTPLFPSL-------------DNDPPPVTLASR-------- 153
           + LN +G+KNDY+WLLTPP TPLFPSL             D+   P TL SR        
Sbjct: 90  DFLNSEGDKNDYEWLLTPPGTPLFPSLEMESHRTMMSQTGDSKSRPATLTSRLANSSTES 149

Query: 154 ------------------------------GRPRSQPIS-ISRSSTMEKSHRSS-----T 213
                                         G P S+P +   RSST+  + +SS     T
Sbjct: 150 AARNHLTSRQQTSSPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPT 209

Query: 214 SRGSAS-------PNRLSPSPRSASSVPQMRGRQLSAPHSSPTP------------SLRH 273
           SR + S        N  S    +    P  R   LS+   +PT             S+  
Sbjct: 210 SRATVSSATRPSLTNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTR 269

Query: 274 ATPS--------RRSTTPARRSSPPPST--------PSISVTRSSTPTPRRLSTGSSGTS 333
           +TPS         RSTTP  RS+   ST        PS +++RSSTPT R +++ S+ T+
Sbjct: 270 STPSTTTKSAGPSRSTTPLSRSTARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATT 329

Query: 334 TT----SGARGTSPIKA----------VRGNSASPKIRA--WQ-TNIPGFSSDAPPNLRT 393
           T     S  + +SP  A              +ASP +R+  W+ +++PGFS + PPNLRT
Sbjct: 330 TANPTISQIKPSSPAPAKPMPTPSKNPALSRAASPTVRSRPWKPSDMPGFSLETPPNLRT 389

Query: 394 SLADRPASYTRG--SSPASRNSM-----DLQYKYSRQSMSPT---APISSSHSHDRDRYS 453
           +L +RP S TRG   +P+SR+           +  RQS SP+   AP+ SS S       
Sbjct: 390 TLPERPLSATRGRPGAPSSRSGSVEPGGPPGGRPRRQSCSPSRGRAPMYSSGSSVPAVNR 449

Query: 454 SYSRGS-------KASSGDDDLDSLQSIPTSSLDNSLSKGGNTFSNNKALTISKKHRIVS 477
            YS+ S         +   + + +++ +     D+  S  GN  + + +   +   R +S
Sbjct: 450 GYSKASDNVSPVMMGTKMVERVINMRKLAPPRSDDKGSPHGNLSAKSSSPDSAGFGRTLS 509

BLAST of MS000966 vs. TAIR 10
Match: AT2G40070.2 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1); Has 108635 Blast hits to 60786 proteins in 2176 species: Archae - 287; Bacteria - 15142; Metazoa - 39415; Fungi - 26849; Plants - 4416; Viruses - 2864; Other Eukaryotes - 19662 (source: NCBI BLink). )

HSP 1 Score: 72.0 bits (175), Expect = 3.6e-12
Identity = 156/565 (27.61%), Postives = 235/565 (41.59%), Query Frame = 0

Query: 48  RERESFLLQSAEDLEDSFSTKL--RHFS----DIKLGISIPVRGENSELLNVDGEKNDYD 107
           +E+++ LL +     D F T L  +H +    +I  G     +    + LN +G+KNDY+
Sbjct: 6   KEQDNLLLNNN---PDEFETPLGSKHGTSPVFNISSGAPPSRKAAPDDFLNSEGDKNDYE 65

Query: 108 WLLTPPDTPLFPSL-------------DNDPPPVTLASR--------------------- 167
           WLLTPP TPLFPSL             D+   P TL SR                     
Sbjct: 66  WLLTPPGTPLFPSLEMESHRTMMSQTGDSKSRPATLTSRLANSSTESAARNHLTSRQQTS 125

Query: 168 -----------------GRPRSQPIS-ISRSSTMEKSHRSS-----TSRGSAS------- 227
                            G P S+P +   RSST+  + +SS     TSR + S       
Sbjct: 126 SPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPTSRATVSSATRPSL 185

Query: 228 PNRLSPSPRSASSVPQMRGRQLSAPHSSPTP------------SLRHATPS--------R 287
            N  S    +    P  R   LS+   +PT             S+  +TPS         
Sbjct: 186 TNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTRSTPSTTTKSAGPS 245

Query: 288 RSTTPARRSSPPPST--------PSISVTRSSTPTPRRLSTGSSGTSTT----SGARGTS 347
           RSTTP  RS+   ST        PS +++RSSTPT R +++ S+ T+T     S  + +S
Sbjct: 246 RSTTPLSRSTARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATTTANPTISQIKPSS 305

Query: 348 PIKA----------VRGNSASPKIRA--WQ-TNIPGFSSDAPPNLRTSLADRPASYTRG- 407
           P  A              +ASP +R+  W+ +++PGFS + PPNLRT+L +RP S TRG 
Sbjct: 306 PAPAKPMPTPSKNPALSRAASPTVRSRPWKPSDMPGFSLETPPNLRTTLPERPLSATRGR 365

Query: 408 -SSPASRNSM-----DLQYKYSRQSMSPT---APISSSHSHDRDRYSSYSRGS------- 467
             +P+SR+           +  RQS SP+   AP+ SS S        YS+ S       
Sbjct: 366 PGAPSSRSGSVEPGGPPGGRPRRQSCSPSRGRAPMYSSGSSVPAVNRGYSKASDNVSPVM 425

Query: 468 KASSGDDDLDSLQSIPTSSLDNSLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTI 477
             +   + + +++ +     D+  S  GN  + + +   +   R +S     K+SLD  I
Sbjct: 426 MGTKMVERVINMRKLAPPRSDDKGSPHGNLSAKSSSPDSAGFGRTLS-----KKSLDMAI 485

BLAST of MS000966 vs. TAIR 10
Match: AT3G09000.1 (proline-rich family protein )

HSP 1 Score: 72.0 bits (175), Expect = 3.6e-12
Identity = 158/560 (28.21%), Postives = 236/560 (42.14%), Query Frame = 0

Query: 30  MCIREKDDDLALFNEMQTRERE---SFLLQSAED------LEDSFSTKLRHFSDIKLGIS 89
           M   ++D++L+LF EM+ RE+E     LL  +++      L  + +  L   S+      
Sbjct: 1   MLTHDRDEELSLFLEMRRREKEHRADSLLTGSDNVSINATLTAAAAAALSGVSETASSQR 60

Query: 90  IPVRGENSE-LLNVDGEKNDYDWLLTPPDTPLFPS-------LDNDPP---PVTLASR-- 149
            P+R   +E  L  + EK+DYDWLLTPP TP F           +D P   P  L SR  
Sbjct: 61  YPLRRTAAENFLYSENEKSDYDWLLTPPGTPQFEKESHRSVMNQHDAPNSRPTVLKSRLG 120

Query: 150 -----------GRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSP----------- 209
                       +P++   S++       S  S ++   A+P R S +P           
Sbjct: 121 NCREDIVSGNNNKPQTSSSSVAGLRRPSSSGSSRSTSRPATPTRRSTTPTTSTSRPVTTR 180

Query: 210 --RSASSVPQMRGRQLSAPHSSPTPSLRHATP---SRRSTTPARRSSPPPSTPSIS--VT 269
              S SS P  R    +A  ++ T + R  T    S RS TP  RS+P PS+ S    V+
Sbjct: 181 ASNSRSSTPTSRATLTAARATTSTAAPRTTTTSSGSARSATPT-RSNPRPSSASSKKPVS 240

Query: 270 RSSTPTPR-RLSTGSSGTSTTSGARGTSPIKAV--------RGNSASPKI---RAWQ-TN 329
           R +TPT R    TG S  S+ + +RGTSP   V        RG S SP +   R W+   
Sbjct: 241 RPATPTRRPSTPTGPSIVSSKAPSRGTSPSPTVNSLSKAPSRGTSPSPTLNSSRPWKPPE 300

Query: 330 IPGFSSDAPPNLRTSLADRPASYTRG-----SSPASRN---------SMDLQYKYSRQSM 389
           +PGFS +APPNLRT+LADRP S +RG     S+P SR+         +        RQS 
Sbjct: 301 MPGFSLEAPPNLRTTLADRPVSASRGRPGVASAPGSRSGSIERGGGPTSGGSGNARRQSC 360

Query: 390 SPT---APISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSS-----LDNSLSKGG 449
           SP+   API +++       +     +KAS+G    D+L  +   +     + N    G 
Sbjct: 361 SPSRGRAPIGNTNG----SLTGVRGRAKASNGGSGCDNLSPVAMGNKMVERVVNMRKLGP 420

Query: 450 NTFSNNKALTISKKHRIVSS----TSTPKRSLDSTIRQLVWILFFIMVCFYSFTGVYMTT 500
              + N      K     +S     +  K S+D  IR         M      TG     
Sbjct: 421 PRLTENGGRGSGKSSSAFNSLGYGRNLSKSSIDMAIRH--------MDIRRGMTG----- 480

BLAST of MS000966 vs. TAIR 10
Match: AT3G08670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G51540.1); Has 48380 Blast hits to 29827 proteins in 1356 species: Archae - 46; Bacteria - 5589; Metazoa - 17361; Fungi - 13192; Plants - 2237; Viruses - 905; Other Eukaryotes - 9050 (source: NCBI BLink). )

HSP 1 Score: 69.7 bits (169), Expect = 1.8e-11
Identity = 133/459 (28.98%), Postives = 206/459 (44.88%), Query Frame = 0

Query: 16  RGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAEDLEDSFSTKLRHFSDI 75
           RG+N+   +  ++G   R+ D++L LF+++    R SF L S+++L D  S KL   S  
Sbjct: 23  RGNNNNSNNISQNGFS-RDSDENLDLFSKI----RRSFPLASSDELPD-VSAKLGRLS-- 82

Query: 76  KLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVTLASRGR----PR 135
            +G  I  +G++  L + +G KNDYDWLLTPP TPL     +      +AS  R     +
Sbjct: 83  -VGSKIAPKGKDDLLSSAEGGKNDYDWLLTPPGTPLGNDSHSSLAAPKIASSARASSASK 142

Query: 136 SQPISISRSSTMEKSHR----SSTSRGSASPNRLS--PSPRSASSVPQMRGRQL------ 195
           +  +S+S+S +   S R    SS +R S S ++ S   S RS SS+       +      
Sbjct: 143 ASRLSVSQSESGYHSSRPARSSSVTRPSISTSQYSSFTSGRSPSSILNTSSASVSSYIRP 202

Query: 196 SAPHSSPTPSLRHATPSR-----RSTTPAR-------------------RSSPPPSTPSI 255
           S+P S  + S R +TP+R     RS+TP+R                   R S P S P +
Sbjct: 203 SSPSSRSSSSARPSTPTRTSSASRSSTPSRIRPGSSSSSMDKARPSLSSRPSTPTSRPQL 262

Query: 256 SVT---------RSSTPTPRRLSTGSSGTSTTSG---------ARGTSPIKAVRGNSASP 315
           S +          S   TP R S  S+  S TSG         + G +     R +S  P
Sbjct: 263 SASSPNIIASRPNSRPSTPTRRSPSSTSLSATSGPTISGGRAASNGRTGPSLSRPSSPGP 322

Query: 316 KIRAWQTN---IPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQYKYSRQSMSPT 375
           ++R        +  F  D PPNLRTSL DRP S  R S P   +SM      ++ S  P 
Sbjct: 323 RVRNTPQQPIVLADFPLDTPPNLRTSLPDRPISAGR-SRPVGGSSM------AKASPEPK 382

Query: 376 APISSSHSH---DRDRYS-SYSRGSKASSGD--------------DDLDSLQSIPTSSLD 395
            PI+  +S     R R + +  +G    +G                D+ S +++ TS+  
Sbjct: 383 GPITRRNSSPIVTRGRLTETQGKGRFGGNGQHLTDAPEPRRISNVSDITSRRTVKTSTTV 442

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022131331.10.0e+0096.77uncharacterized protein LOC111004588 isoform X1 [Momordica charantia] >XP_022131... [more]
XP_022131333.10.0e+0096.68uncharacterized protein LOC111004588 isoform X2 [Momordica charantia][more]
XP_022131334.10.0e+0096.50uncharacterized protein LOC111004588 isoform X3 [Momordica charantia][more]
XP_031737323.10.0e+0079.19serine/arginine repetitive matrix protein 2 isoform X1 [Cucumis sativus] >KGN623... [more]
XP_022961987.10.0e+0078.28flocculation protein FLO11 [Cucurbita moschata] >XP_022961988.1 flocculation pro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1BQQ50.0e+0096.77uncharacterized protein LOC111004588 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1BT220.0e+0096.68uncharacterized protein LOC111004588 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1BPZ70.0e+0096.50uncharacterized protein LOC111004588 isoform X3 OS=Momordica charantia OX=3673 G... [more]
A0A0A0LKP50.0e+0079.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G350460 PE=4 SV=1[more]
A0A6J1HBJ80.0e+0078.28flocculation protein FLO11 OS=Cucurbita moschata OX=3662 GN=LOC111462594 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT1G27850.17.1e-17839.87unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G40070.14.1e-1628.03BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT... [more]
AT2G40070.23.6e-1227.61FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT3G09000.13.6e-1228.21proline-rich family protein [more]
AT3G08670.11.8e-1128.98unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 14..29
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1126..1157
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 129..244
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 925..957
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1100..1119
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 257..319
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 332..347
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..29
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 98..347
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 878..899
NoneNo IPR availablePANTHERPTHR31949:SF3RUN/FYVE DOMAIN PROTEINcoord: 1..397
coord: 430..1176
NoneNo IPR availablePANTHERPTHR31949GASTRIC MUCIN-LIKE PROTEINcoord: 1..397
coord: 430..1176

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS000966.1MS000966.1mRNA