Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGGAAAAAAGAAAAAGAAAGGAAAATAACCAAAGCCCTAGTTTTCCTGGATCCTTCTCTATAAGTTTGCGTGGAGCCGAATCTCCACCTTCCACGCAAATTCTCCATCTCCCAATCTTAATCCTTTGCCGATTGCGAATACTATTTCTCTCGCCATCGTCGATTTGGGTCAATGCCATCGTCGATTTGGGTTTCTCATGTTGTTGTCTTGATTCGATTGTTAGTTTCTCTCGTTACCGTCAGACATACAAGCGCGTATTAGATCTGCGTTTCGGTATTTTTCGCATCTCTGCCTTTAGGATAGCAATCTCTTTGAATCGGCTGAGTAAGATAGATTTATCTGTTATCGAGCATGTTTCTTTCGAGGCAGAGGGGAGTTTTGTGGAAGTTTGAGTTATTGGTTGAGGAGAAGCTGTTCAAATACTGGGATGTCTTTTAGCAATTCAATCGAATTCCTTATCAAGTGATTATGACGATTTGCTGGTCGGTTAGATGTGGAATATGCTTGCGTACATGGGGGGTTTATGCTTCTTTATTTGTGTTCCTTTTAATGTTTTTCATCAATTTTGTATTGTACGTTAGGAATCTGGCATCTTTTGTTCTTTTATGAGCTGGATGGAGAAGTTTTTGCGTGGATGTGAGAATAATTATTAGGGAAATCGGGGAAGTATGAGATTGTGCATTGTTGCTCTGAGATTTAATAACATTATGCCCAAAAGTTCACTGCTGGAGTTTGATATAGAGTTAATTGTGCTGGGTCTCATCTGATCCAATCTCAAGGCATGGATCGTCTAAGCTTTTGTGGTTGAAGTTTGAACACAGGAACTTCATATGCTAAGATGGGCATGCAGAATTTAAGATTTGGGTGCAGGTTTTACGATATCTTCACGATGGAAACGTTCTGCATCGCTAATTTCTGTAGTTTTTGCTTTCAGATTAAATGATTGTTTCCACATAATGTCTGTCATATGCTATCTTGCATATTGAATCTTTTTGAAGATTACTAATGTATTAGTATCCATACGGTGCAGAATTTGAGAGTGACTCCTGGGAAGTTAGTGAGAATTTGGTGTTGGAATAAAGTGATAAAAGACCAAGATACTTTTTTAAGCCTCTGAGCTCTTGGAATAATGATGGCCGTACCAGAAAGTGAAGAGGTGCACTATATTTTTCTTTGCTGTTTTCTGGATTTTGAAGATCGTGCTAGGTTTCTCTTATATGGTTATAATTATTGTCAGTAAATGTTCTATTGGCACTTGGCAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTACTGATTATGGTGCAAGTCTTCCTATCAAGAAAAGGAGATTTCCGGTTGTGCAGTCTCCCCCCTCTCCATCTAAAGATATATCTTCATTTCATCCAGATGGAAATTTAATGAAGATTGAACAGCCATCTCCACCTAAAGATGAGTTGTTTCATATCGATGCAAATTTAACGAGTGAAAGGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGTCACAAGTTCTGGGTTGTCGAACAAGAACCAGGATTGCATTTCTAACGAAAATAAAGGCGAATCTGGAACTGATTCATGTTATGCAGATGTGGTCCAGAGTGATTCTGGAATGCCAGTAGTAAAGTTTCAGGAATCTAGTTTGGGAGGGCATGTTTCTTTGAATGGTTATGTTGAATGTGAAGACAAGTCCTTGGTAACCGAAAAACACACTGTTCATGCATCACCAGAGATCTGTGGGGGGCTGAAGTTATCATCAACTAGCCTCAACTCCGATCCTCATGCTGGTAACAAAGAGGAAGAAATTGATGTAAAAGTTGAAGGAGGAGCTGAAGTACCTGTAGGGTTGAAGGAAGATGTGAAACCAAAATTGGTTCCTGAAAAGAGTGATATGAATTTCCTGAAGCAGAACACTAAGGAACATGTGTTACTGGACTTGTCTTTAAACAAGCAGGAAAGTGGCACCCACTGTGTCAAAGGTAATGCAGGGTCTGATTATGATGGTTCTCTTTTGCATTCAAACAGGGAAAATTGGGATCTAAATACCTCAATGGATTCGTGGGAGAGTTGTACTAGTGATGCACCTGTAGGGCAGATATCATCCACTCAGACAAATACGGCTGCTGAGACAAATGTGTGCTCATCTGAAATGGTTGAAAGTGACAATCCATGTGTAAAGCAAACCTTTTTAGATGGTGAACATAAAGGAAACTCTATTAATGAATGCGTACCATCAAATGATCATCTTCATTTAAGTCTCAATTTATCTTATCCAAAGCCTATGCTTGAAGAAGATCCTTATCTTTCTGAATATGAATCAGATGGCAATTGGGATATTGCTGAGTCTGTCGATGATGATGATAATAATATAGAAGAAGATTATGAAGATGGTGAGGTTCGGGAAACAATGCCTGAAACTGAAGTAGAGGTCCATATGTGTGAGAAACGAGGAATTGAGACTTTTGATCATGCTGATTGTGATGATAAGAAGATCAATTCAGTTGGATTGCCTAATCGTGAGGGTTTCACTTTAGGCTCTCTAGAGCAGGAAACTGAACCAGAAAATCTGAATGTTAGAAGTGAAGACGATGTTCATACTACAACTATAAGTAAATCTTTTGAACAGGAAAATGAAGGTCGTTGTGTGGAAGAAGTACATGCCGTAGATAATACTAGTATAGAGGATGTAAACAGGCCTGTGAAGGCTGCAGGAAGAAACCAATTGTCTCAATATGATGAAAGGGATAACTTTGAGGACCAGGACACTGCTGATAAAGCCATTGATGGAATTCAGGAATTGATTCCGGCAGTTTCTCAGGGTGAGGTGGAGAGTGCTATAGCAGTAGATATAGTGCAGAATAAGGATTTAATTTTGCCTAGTGTCAAGGAGTCTGTAAGTAGTGATGATGTGAAGGATATTTATAGTGGCACTAAAAATAGCCGGATAATTAATCTTAATCGAGGTTCTGCTGATTCAACCCCTTGTAAGGAAAAATCTGGTTTTGTCAGGTCAGTTTTATCTCGTACTGATAGAGAGTTTGTACCCAGCATGGCACTTGAGGGAGCAAATGTGCAACCTCAAGAAAGGTGATCATCTGTTACTTGATCTCTTTTATCTGTTGTTCTTCTCAAACCTGTTTTAACTGCACTTCTGCCTTCCTTATTTTATCCCTTTTTTATTTTTTATTTTTTATTTTCAGAGATGACGCTTATGGTGATACTACCAAGAAATTTTCAGTAGACAGATCCCAGGATCAATCACAATGGAAGAATTTTAGTCATAGAAGAGGGAGAAGTACTAATAGGTTGGATACCCGACCTGGGGAATGGGATTTTGGTCCCAACTTCTCTCCCGAAACATACACCGACCAACAGATTGATTACCATGTTCCTGGTCTTGATCAAAACCGATATAAAATTATACCAGATGGTCCATTTGGTGGTGCTAACCATCGTGGTAGGCAATTGCCAGATGATGAGGGGCCTTATTTTTTCCATGGACCCTCAAGGAGGAAGTCACCTGGAAGAAGACATGGGCCCGGTGTACATGGTGGCAAAATGGTTAACAGAATTCCTAGAGATTTTAGTCCAAATAGATGCATGGATGAGGGCGGTTCCTTTGATCGACAACATGGTGAAAAGTTCACTAGGAATTTTGCTGATGACACAGAGGATCCATTATATGCTCGACCTCAACTTCCATATGAGGTAGACAGACCTTTCTTTCGGGAAAGAAGGAACTTCTCATTCCAAAGAAAAAGTTTTCCCAGAATCGATTCCAAATCTCCAGTGAGATCCCGAGCTCGTTCTCCTAACCAATGGTTTTCTTCAAAAAGATCTGATAGGTTTTGTGGACGTTCCGACATGATACATCGAAGACCTCCAAGTTATAGGATGGACAGGATAAGATCTCCTGATCAGCCTCCTATACGTGGGCGTATGGCAGGCCGAAGACAAGGATTCCGTTACCTTTCGCCATCTGATGACATGATGAGGGACGTGGGTCCTGCTCCTGATCATGGCCCCATAAGGTCTCTTATTCCTAATAGGAATCAGAATGAAAGATTACCACTTAGAAACAGAAGTTTTGATGCTATAGATCCCAGAGGAAGGATTGAGAGCGACGAACTTTTTGATGGTCCTGTACGTTCGGGTCAATTGAGTGGGTATAATGGTGGTGAACATGAGGACGATGAAAGAAGATTTAATGAGAGACATGAACCTGTCCATTCTTTTAAGCATCCATATGATGATTCTGATGGTGAGAGATTTCGAAACAACGGTGAAGATTGTTCTAGGCCTTTTAGATTTTGTGCAGAGAATGACTCAAGAATTTCATGGAAGAGAAGGTAGCTTTTTGGGAGAGAATGAGTTGAGAACTTTTACCAGGCGAGTGGAGATTGCTTTAGACTAGGCCATTCTGTTAGTGTAACTTAGATGGTTCATAGATGATATTTTATGGTGCAAAGGGGGAAAATTTTTATGTTCTCTTTTGTTTTATTTCATAATTGCAATTGTAAAAAATTCCATTCTCCATGCAGAAGAGTATATGCTTCTCAAGTGCTATTGTGACTTCAAAGTGACTAAACAACTATGCAAAAATTGTTCTCTTAATCGTTAAGATCTTTCTTGTCCACTGCGATGATTACTTGCATACAGTGCATGCTTATGCTCTTGATAGGAAATTTCAGCTCTTAGTAGGTAAATTCAGTGGAGTTGTTCTCCCTCTCCAATGCCAATATAAGGTGAAACTAAATAGTTCGAAATACCATCAGTTTGCAATACTTTCTTTTAGTACACGTGCCTTCACTGCTTTACTGTTGAATTTTACAGTTCGTCTAGAGCATGGAGAATCACAAAAGGTATGTCTACTATATTCTTGTCTAAAAGTTTTTATTTGGTTGCATATGATTTATTGATCTCATGCAGATTTATTAGTAAAAAAAGGAGCTCTACTTTAGTTGCACTGTTTAATTTAATTTCTTTTTTCTTATAGATTCTATTTTCAGATGTTCTAAATAGTGATGATGGCGCTTCAAAAGCGATGGTACTGAGTGACGATGACAAACAGCTCTTTGAGGTCCCTTTCTTCCTCTCTAGCTATAAAATCGTATTTTCAGGAAGTTGTACACTAATGCTGTTCATCATGAAATTTTGTTTAACTCTTAGTTGACTTGGCATTATCATGAAGTGGCTGTATGCTCTGGCAGTGTAGTTTGGTTACCTCTGTTTTCAAACCCCAAATCCCTTCTGCAATAGATATCTTTAGTTTATTTTATTGTTTGTTTACCTAATAGTCTAGGTCAGTTTAGTTCTTTATTCTTACCTGCTTGTGTGATGTGGATTCTGAACAATTTTTTTAAACCTTGGACATGTACAAGCTTATCACTGTTAACTACATGTATTATGACATGTATGTGCTAGGAAGAAGTTGAAATCTAGATTTCACATGTGAGATCCTGAGATCCTACGTTGGCTTGAAAGAGAAACAAAACATTCTTTATAAAGGCCGTAGAAATCTTTCTCTGGTAGACACGTTTTAAAATCTTGAATGGAATCCCAGAAGGGAAAACTCGAAAGGTAAATCCCAGAAGGGAAAACTCGAAAGGTAAAGTTTGGAAGAGAAAGTCCAAAGAGCACAATATCTGCTAGCGGTGAGCTTGGACTGTTACAAATGGTATCAGTGCAAGTGACCGAGCGGTATGCCAGCAAGGACGTTGGATCTCAATGGGAGATAGATTATGAGATCCGTGGTTGGAGAGGGGAAGGAAGAATTCTATTTTTTCAACGTTACTTGTATAGATTGTATAGATTTGAGTGCTGTCCAGGCTTTAAAAGATTTGTATTGTATTGGGAGTATGAACTTGGTGATGTTCAGGTAGGCATAAGATAAATAATCTCCGAAGATTTCTTCATAACCATTTGGTCTGTATATATATATTTCCATTTTCAAGCTCCCATTGTTGAACTTATTGGCAGGAGTGATATTTTTCTAACCCTATATATATATATATATATAATTTTTTAGACACTAACTCTTTGGTTGGATTGTCTAACTCTTTGGTTGGATCGAATCGGACAGGGAACTTCTATAAAGATCTTTCCACTCATTCATCATACTCAAAGTCGAAGTCACGGCCGACATGGGAGGCTCGGAAAAAGAACGAGAATCGGTGTCGTGGAAGGAAACAACCAAGAGGGGCAGATTCCTCACCAAAAGATGCTCCAAAGGTTAGTGAGGTCCAGAAGTGAGTATCTATCCCAATGAAATGGACACTCAATTGGAACCATTACTATTTCAAAACCCTTCAAATTCTTTTGTAAGGTTGTGCTTATAACTTGATTTGCTGTAATGTTGGGTAAGTACAATCGATGCTCGAAAACCCGTCAAGATGCTCGAAAACCCGTCAAACCGTTCAATTGGATATTCTTGTATCTTAATTTCTATCCCACAATTTTGTGTCAAACAAATTTAAATCTTGTACCTAATAAAAGTCTAA
mRNA sequence
AGGAAAAAAGAAAAAGAAAGGAAAATAACCAAAGCCCTAGTTTTCCTGGATCCTTCTCTATAAGTTTGCGTGGAGCCGAATCTCCACCTTCCACGCAAATTCTCCATCTCCCAATCTTAATCCTTTGCCGATTGCGAATACTATTTCTCTCGCCATCGTCGATTTGGGTCAATGCCATCGTCGATTTGGGTTTCTCATGTTGTTGTCTTGATTCGATTGTTAGTTTCTCTCGTTACCGTCAGACATACAAGCGCGTATTAGATCTGCGTTTCGAATTTGAGAGTGACTCCTGGGAAGTTAGTGAGAATTTGGTGTTGGAATAAAGTGATAAAAGACCAAGATACTTTTTTAAGCCTCTGAGCTCTTGGAATAATGATGGCCGTACCAGAAAGTGAAGAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTACTGATTATGGTGCAAGTCTTCCTATCAAGAAAAGGAGATTTCCGGTTGTGCAGTCTCCCCCCTCTCCATCTAAAGATATATCTTCATTTCATCCAGATGGAAATTTAATGAAGATTGAACAGCCATCTCCACCTAAAGATGAGTTGTTTCATATCGATGCAAATTTAACGAGTGAAAGGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGTCACAAGTTCTGGGTTGTCGAACAAGAACCAGGATTGCATTTCTAACGAAAATAAAGGCGAATCTGGAACTGATTCATGTTATGCAGATGTGGTCCAGAGTGATTCTGGAATGCCAGTAGTAAAGTTTCAGGAATCTAGTTTGGGAGGGCATGTTTCTTTGAATGGTTATGTTGAATGTGAAGACAAGTCCTTGGTAACCGAAAAACACACTGTTCATGCATCACCAGAGATCTGTGGGGGGCTGAAGTTATCATCAACTAGCCTCAACTCCGATCCTCATGCTGGTAACAAAGAGGAAGAAATTGATGTAAAAGTTGAAGGAGGAGCTGAAGTACCTGTAGGGTTGAAGGAAGATGTGAAACCAAAATTGGTTCCTGAAAAGAGTGATATGAATTTCCTGAAGCAGAACACTAAGGAACATGTGTTACTGGACTTGTCTTTAAACAAGCAGGAAAGTGGCACCCACTGTGTCAAAGGTAATGCAGGGTCTGATTATGATGGTTCTCTTTTGCATTCAAACAGGGAAAATTGGGATCTAAATACCTCAATGGATTCGTGGGAGAGTTGTACTAGTGATGCACCTGTAGGGCAGATATCATCCACTCAGACAAATACGGCTGCTGAGACAAATGTGTGCTCATCTGAAATGGTTGAAAGTGACAATCCATGTGTAAAGCAAACCTTTTTAGATGGTGAACATAAAGGAAACTCTATTAATGAATGCGTACCATCAAATGATCATCTTCATTTAAGTCTCAATTTATCTTATCCAAAGCCTATGCTTGAAGAAGATCCTTATCTTTCTGAATATGAATCAGATGGCAATTGGGATATTGCTGAGTCTGTCGATGATGATGATAATAATATAGAAGAAGATTATGAAGATGGTGAGGTTCGGGAAACAATGCCTGAAACTGAAGTAGAGGTCCATATGTGTGAGAAACGAGGAATTGAGACTTTTGATCATGCTGATTGTGATGATAAGAAGATCAATTCAGTTGGATTGCCTAATCGTGAGGGTTTCACTTTAGGCTCTCTAGAGCAGGAAACTGAACCAGAAAATCTGAATGTTAGAAGTGAAGACGATGTTCATACTACAACTATAAGTAAATCTTTTGAACAGGAAAATGAAGGTCGTTGTGTGGAAGAAGTACATGCCGTAGATAATACTAGTATAGAGGATGTAAACAGGCCTGTGAAGGCTGCAGGAAGAAACCAATTGTCTCAATATGATGAAAGGGATAACTTTGAGGACCAGGACACTGCTGATAAAGCCATTGATGGAATTCAGGAATTGATTCCGGCAGTTTCTCAGGGTGAGGTGGAGAGTGCTATAGCAGTAGATATAGTGCAGAATAAGGATTTAATTTTGCCTAGTGTCAAGGAGTCTGTAAGTAGTGATGATGTGAAGGATATTTATAGTGGCACTAAAAATAGCCGGATAATTAATCTTAATCGAGGTTCTGCTGATTCAACCCCTTGTAAGGAAAAATCTGGTTTTGTCAGGTCAGTTTTATCTCGTACTGATAGAGAGTTTGTACCCAGCATGGCACTTGAGGGAGCAAATGTGCAACCTCAAGAAAGAGATGACGCTTATGGTGATACTACCAAGAAATTTTCAGTAGACAGATCCCAGGATCAATCACAATGGAAGAATTTTAGTCATAGAAGAGGGAGAAGTACTAATAGGTTGGATACCCGACCTGGGGAATGGGATTTTGGTCCCAACTTCTCTCCCGAAACATACACCGACCAACAGATTGATTACCATGTTCCTGGTCTTGATCAAAACCGATATAAAATTATACCAGATGGTCCATTTGGTGGTGCTAACCATCGTGGTAGGCAATTGCCAGATGATGAGGGGCCTTATTTTTTCCATGGACCCTCAAGGAGGAAGTCACCTGGAAGAAGACATGGGCCCGGTGTACATGGTGGCAAAATGGTTAACAGAATTCCTAGAGATTTTAGTCCAAATAGATGCATGGATGAGGGCGGTTCCTTTGATCGACAACATGGTGAAAAGTTCACTAGGAATTTTGCTGATGACACAGAGGATCCATTATATGCTCGACCTCAACTTCCATATGAGGTAGACAGACCTTTCTTTCGGGAAAGAAGGAACTTCTCATTCCAAAGAAAAAGTTTTCCCAGAATCGATTCCAAATCTCCAGTGAGATCCCGAGCTCGTTCTCCTAACCAATGGTTTTCTTCAAAAAGATCTGATAGGTTTTGTGGACGTTCCGACATGATACATCGAAGACCTCCAAGTTATAGGATGGACAGGATAAGATCTCCTGATCAGCCTCCTATACGTGGGCGTATGGCAGGCCGAAGACAAGGATTCCGTTACCTTTCGCCATCTGATGACATGATGAGGGACGTGGGTCCTGCTCCTGATCATGGCCCCATAAGGTCTCTTATTCCTAATAGGAATCAGAATGAAAGATTACCACTTAGAAACAGAAGTTTTGATGCTATAGATCCCAGAGGAAGGATTGAGAGCGACGAACTTTTTGATGGTCCTGTACGTTCGGGTCAATTGAGTGGGTATAATGGTGGTGAACATGAGGACGATGAAAGAAGATTTAATGAGAGACATGAACCTGTCCATTCTTTTAAGCATCCATATGATGATTCTGATGGTGAGAGATTTCGAAACAACGGTGAAGATTGTTCTAGGCCTTTTAGATTTTGTGCAGAGAATGACTCAAGAATTTCATGGAAGAGAAGGTAGCTTTTTGGGAGAGAATGAGTTGAGAACTTTTACCAGGCGAGTGGAGATTGCTTTAGACTAGGCCATTCTGTTAGTGTAACTTAGATGGTTCATAGATGATATTTTATGGTGCAAAGGGGGAAAATTTTTATGTTCTCTTTTGTTTTATTTCATAATTGCAATTGTAAAAAATTCCATTCTCCATGCAGAAGAGTATATGCTTCTCAAGTGCTATTGTGACTTCAAAGTGACTAAACAACTATGCAAAAATTGTTCTCTTAATCGTTAAGATCTTTCTTGTCCACTGCGATGATTACTTGCATACAGTGCATGCTTATGCTCTTGATAGGAAATTTCAGCTCTTAGTAGGTAAATTCAGTGGAGTTGTTCTCCCTCTCCAATGCCAATATAAGGTGAAACTAAATAGTTCGAAATACCATCAGTTTGCAATACTTTCTTTTAGTACACGTGCCTTCACTGCTTTACTGTTGAATTTTACAGTTCGTCTAGAGCATGGAGAATCACAAAAGATTCTATTTTCAGATGTTCTAAATAGTGATGATGGCGCTTCAAAAGCGATGGTACTGAGTGACGATGACAAACAGCTCTTTGAGGGAACTTCTATAAAGATCTTTCCACTCATTCATCATACTCAAAGTCGAAGTCACGGCCGACATGGGAGGCTCGGAAAAAGAACGAGAATCGGTGTCGTGGAAGGAAACAACCAAGAGGGGCAGATTCCTCACCAAAAGATGCTCCAAAGGTTAGTGAGGTCCAGAAGTGAGTATCTATCCCAATGAAATGGACACTCAATTGGAACCATTACTATTTCAAAACCCTTCAAATTCTTTTGTAAGGTTGTGCTTATAACTTGATTTGCTGTAATGTTGGGTAAGTACAATCGATGCTCGAAAACCCGTCAAGATGCTCGAAAACCCGTCAAACCGTTCAATTGGATATTCTTGTATCTTAATTTCTATCCCACAATTTTGTGTCAAACAAATTTAAATCTTGTACCTAATAAAAGTCTAA
Coding sequence (CDS)
ATGATGGCCGTACCAGAAAGTGAAGAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTACTGATTATGGTGCAAGTCTTCCTATCAAGAAAAGGAGATTTCCGGTTGTGCAGTCTCCCCCCTCTCCATCTAAAGATATATCTTCATTTCATCCAGATGGAAATTTAATGAAGATTGAACAGCCATCTCCACCTAAAGATGAGTTGTTTCATATCGATGCAAATTTAACGAGTGAAAGGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGTCACAAGTTCTGGGTTGTCGAACAAGAACCAGGATTGCATTTCTAACGAAAATAAAGGCGAATCTGGAACTGATTCATGTTATGCAGATGTGGTCCAGAGTGATTCTGGAATGCCAGTAGTAAAGTTTCAGGAATCTAGTTTGGGAGGGCATGTTTCTTTGAATGGTTATGTTGAATGTGAAGACAAGTCCTTGGTAACCGAAAAACACACTGTTCATGCATCACCAGAGATCTGTGGGGGGCTGAAGTTATCATCAACTAGCCTCAACTCCGATCCTCATGCTGGTAACAAAGAGGAAGAAATTGATGTAAAAGTTGAAGGAGGAGCTGAAGTACCTGTAGGGTTGAAGGAAGATGTGAAACCAAAATTGGTTCCTGAAAAGAGTGATATGAATTTCCTGAAGCAGAACACTAAGGAACATGTGTTACTGGACTTGTCTTTAAACAAGCAGGAAAGTGGCACCCACTGTGTCAAAGGTAATGCAGGGTCTGATTATGATGGTTCTCTTTTGCATTCAAACAGGGAAAATTGGGATCTAAATACCTCAATGGATTCGTGGGAGAGTTGTACTAGTGATGCACCTGTAGGGCAGATATCATCCACTCAGACAAATACGGCTGCTGAGACAAATGTGTGCTCATCTGAAATGGTTGAAAGTGACAATCCATGTGTAAAGCAAACCTTTTTAGATGGTGAACATAAAGGAAACTCTATTAATGAATGCGTACCATCAAATGATCATCTTCATTTAAGTCTCAATTTATCTTATCCAAAGCCTATGCTTGAAGAAGATCCTTATCTTTCTGAATATGAATCAGATGGCAATTGGGATATTGCTGAGTCTGTCGATGATGATGATAATAATATAGAAGAAGATTATGAAGATGGTGAGGTTCGGGAAACAATGCCTGAAACTGAAGTAGAGGTCCATATGTGTGAGAAACGAGGAATTGAGACTTTTGATCATGCTGATTGTGATGATAAGAAGATCAATTCAGTTGGATTGCCTAATCGTGAGGGTTTCACTTTAGGCTCTCTAGAGCAGGAAACTGAACCAGAAAATCTGAATGTTAGAAGTGAAGACGATGTTCATACTACAACTATAAGTAAATCTTTTGAACAGGAAAATGAAGGTCGTTGTGTGGAAGAAGTACATGCCGTAGATAATACTAGTATAGAGGATGTAAACAGGCCTGTGAAGGCTGCAGGAAGAAACCAATTGTCTCAATATGATGAAAGGGATAACTTTGAGGACCAGGACACTGCTGATAAAGCCATTGATGGAATTCAGGAATTGATTCCGGCAGTTTCTCAGGGTGAGGTGGAGAGTGCTATAGCAGTAGATATAGTGCAGAATAAGGATTTAATTTTGCCTAGTGTCAAGGAGTCTGTAAGTAGTGATGATGTGAAGGATATTTATAGTGGCACTAAAAATAGCCGGATAATTAATCTTAATCGAGGTTCTGCTGATTCAACCCCTTGTAAGGAAAAATCTGGTTTTGTCAGGTCAGTTTTATCTCGTACTGATAGAGAGTTTGTACCCAGCATGGCACTTGAGGGAGCAAATGTGCAACCTCAAGAAAGAGATGACGCTTATGGTGATACTACCAAGAAATTTTCAGTAGACAGATCCCAGGATCAATCACAATGGAAGAATTTTAGTCATAGAAGAGGGAGAAGTACTAATAGGTTGGATACCCGACCTGGGGAATGGGATTTTGGTCCCAACTTCTCTCCCGAAACATACACCGACCAACAGATTGATTACCATGTTCCTGGTCTTGATCAAAACCGATATAAAATTATACCAGATGGTCCATTTGGTGGTGCTAACCATCGTGGTAGGCAATTGCCAGATGATGAGGGGCCTTATTTTTTCCATGGACCCTCAAGGAGGAAGTCACCTGGAAGAAGACATGGGCCCGGTGTACATGGTGGCAAAATGGTTAACAGAATTCCTAGAGATTTTAGTCCAAATAGATGCATGGATGAGGGCGGTTCCTTTGATCGACAACATGGTGAAAAGTTCACTAGGAATTTTGCTGATGACACAGAGGATCCATTATATGCTCGACCTCAACTTCCATATGAGGTAGACAGACCTTTCTTTCGGGAAAGAAGGAACTTCTCATTCCAAAGAAAAAGTTTTCCCAGAATCGATTCCAAATCTCCAGTGAGATCCCGAGCTCGTTCTCCTAACCAATGGTTTTCTTCAAAAAGATCTGATAGGTTTTGTGGACGTTCCGACATGATACATCGAAGACCTCCAAGTTATAGGATGGACAGGATAAGATCTCCTGATCAGCCTCCTATACGTGGGCGTATGGCAGGCCGAAGACAAGGATTCCGTTACCTTTCGCCATCTGATGACATGATGAGGGACGTGGGTCCTGCTCCTGATCATGGCCCCATAAGGTCTCTTATTCCTAATAGGAATCAGAATGAAAGATTACCACTTAGAAACAGAAGTTTTGATGCTATAGATCCCAGAGGAAGGATTGAGAGCGACGAACTTTTTGATGGTCCTGTACGTTCGGGTCAATTGAGTGGGTATAATGGTGGTGAACATGAGGACGATGAAAGAAGATTTAATGAGAGACATGAACCTGTCCATTCTTTTAAGCATCCATATGATGATTCTGATGGTGAGAGATTTCGAAACAACGGTGAAGATTGTTCTAGGCCTTTTAGATTTTGTGCAGAGAATGACTCAAGAATTTCATGGAAGAGAAGGTAG
Protein sequence
MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQPSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCYADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTSLNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSLNKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAAETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYLSEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDDKKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAVDNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAVDIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSRTDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTRPGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFHGPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDPLYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGRSDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSLIPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERHEPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR
Homology
BLAST of Cp4.1LG01g24100 vs. NCBI nr
Match:
XP_023535366.1 (uncharacterized protein LOC111796821 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2015 bits (5221), Expect = 0.0
Identity = 1004/1004 (100.00%), Postives = 1004/1004 (100.00%), Query Frame = 0
Query: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ
Sbjct: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
Query: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY
Sbjct: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
Query: 121 ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS
Sbjct: 121 ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
Query: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL 240
LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL
Sbjct: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL 240
Query: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA
Sbjct: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
Query: 301 ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL 360
ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL
Sbjct: 301 ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL 360
Query: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD
Sbjct: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
Query: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 480
KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV
Sbjct: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 480
Query: 481 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 540
DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV
Sbjct: 481 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 540
Query: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR
Sbjct: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
Query: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR
Sbjct: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
Query: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH
Sbjct: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
Query: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 780
GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP
Sbjct: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 780
Query: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR
Sbjct: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
Query: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL
Sbjct: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
Query: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH
Sbjct: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
Query: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR
Sbjct: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
BLAST of Cp4.1LG01g24100 vs. NCBI nr
Match:
XP_022958498.1 (uncharacterized protein LOC111459702 [Cucurbita moschata])
HSP 1 Score: 1978 bits (5125), Expect = 0.0
Identity = 985/1004 (98.11%), Postives = 996/1004 (99.20%), Query Frame = 0
Query: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ
Sbjct: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
Query: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
PSPPKDELFHI+ANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDC+SNENKGESGTDSCY
Sbjct: 61 PSPPKDELFHINANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCVSNENKGESGTDSCY 120
Query: 121 ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
DVVQSDSGMPVVKFQESSLG HVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS
Sbjct: 121 VDVVQSDSGMPVVKFQESSLGRHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
Query: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL 240
LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNT+EHVLLDLSL
Sbjct: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTEEHVLLDLSL 240
Query: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA
Sbjct: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
Query: 301 ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL 360
ETNVCSSEMVESDNPCVKQ+FLDGEHKGNSINEC+P+NDHLHLSLNLSYPKPMLEEDPYL
Sbjct: 301 ETNVCSSEMVESDNPCVKQSFLDGEHKGNSINECIPANDHLHLSLNLSYPKPMLEEDPYL 360
Query: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEV+VHMCEKRGIETFDHADCDD
Sbjct: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVQVHMCEKRGIETFDHADCDD 420
Query: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 480
KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKS EQENE RCVEEVHAV
Sbjct: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSSEQENEDRCVEEVHAV 480
Query: 481 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 540
+ TSIEDVNRPVKAAGRNQLSQYDERD+FE QDTADKAIDGIQ+LIPAVSQGEVESAIAV
Sbjct: 481 EYTSIEDVNRPVKAAGRNQLSQYDERDDFEGQDTADKAIDGIQQLIPAVSQGEVESAIAV 540
Query: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR
Sbjct: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
Query: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR
Sbjct: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
Query: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH
Sbjct: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
Query: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 780
GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRC+DEGGSFDRQHGEKFTRNFADDTEDP
Sbjct: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCLDEGGSFDRQHGEKFTRNFADDTEDP 780
Query: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR
Sbjct: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
Query: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
SDMI RRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHG IRSL
Sbjct: 841 SDMIPRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGSIRSL 900
Query: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH
Sbjct: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
Query: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR
Sbjct: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
BLAST of Cp4.1LG01g24100 vs. NCBI nr
Match:
KAG6602352.1 (hypothetical protein SDJN03_07585, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1975 bits (5117), Expect = 0.0
Identity = 985/1004 (98.11%), Postives = 996/1004 (99.20%), Query Frame = 0
Query: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ
Sbjct: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
Query: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
PSPPKDELFHI+ANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY
Sbjct: 61 PSPPKDELFHINANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
Query: 121 ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
DVVQSDSGMPVVKFQESSLG HVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS
Sbjct: 121 VDVVQSDSGMPVVKFQESSLGRHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
Query: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL 240
LNSDPHAGNKEEEIDVKV GGAEVPVGLKEDVKPKLVPEKSDMNFLKQNT+EHVLLDLSL
Sbjct: 181 LNSDPHAGNKEEEIDVKV-GGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTEEHVLLDLSL 240
Query: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA
Sbjct: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
Query: 301 ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL 360
ETNVCSSEMVESDNPCVKQ+FLDGEHKGNSINEC+P+NDHLHLSLNLSYPKPMLEEDPYL
Sbjct: 301 ETNVCSSEMVESDNPCVKQSFLDGEHKGNSINECIPANDHLHLSLNLSYPKPMLEEDPYL 360
Query: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEV+VHMCEKRGIETFDHADCDD
Sbjct: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVQVHMCEKRGIETFDHADCDD 420
Query: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 480
KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKS EQENE RCVEEVHAV
Sbjct: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSSEQENEDRCVEEVHAV 480
Query: 481 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 540
+NTSIEDVNRPVKAAGRNQLSQYDERD+FE QDTADKAIDGIQ+LIPAVSQGEVESAIAV
Sbjct: 481 ENTSIEDVNRPVKAAGRNQLSQYDERDDFEGQDTADKAIDGIQQLIPAVSQGEVESAIAV 540
Query: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR
Sbjct: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
Query: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR
Sbjct: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
Query: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH
Sbjct: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
Query: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 780
GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRC+DEGGSFDRQHGEKFTRNFADDTEDP
Sbjct: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCLDEGGSFDRQHGEKFTRNFADDTEDP 780
Query: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
LYARPQLPYE+DRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR
Sbjct: 781 LYARPQLPYEIDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
Query: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
SDMI RRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHG IRSL
Sbjct: 841 SDMIPRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGSIRSL 900
Query: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH
Sbjct: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
Query: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR
Sbjct: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1003
BLAST of Cp4.1LG01g24100 vs. NCBI nr
Match:
KAG7033032.1 (hypothetical protein SDJN02_07085, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1962 bits (5083), Expect = 0.0
Identity = 976/996 (97.99%), Postives = 988/996 (99.20%), Query Frame = 0
Query: 9 EVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQPSPPKDEL 68
+VGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQPSPPKDEL
Sbjct: 77 QVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQPSPPKDEL 136
Query: 69 FHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCYADVVQSDS 128
FHI+ANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDC+SNENKGESGTDSCY DVVQSDS
Sbjct: 137 FHINANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCVSNENKGESGTDSCYVDVVQSDS 196
Query: 129 GMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTSLNSDPHAG 188
GMPVVKFQESSLG HVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTSLNSDPHAG
Sbjct: 197 GMPVVKFQESSLGRHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTSLNSDPHAG 256
Query: 189 NKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSLNKQESGTH 248
NKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNT+EHVLLDLSLNKQESGTH
Sbjct: 257 NKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTEEHVLLDLSLNKQESGTH 316
Query: 249 CVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAAETNVCSSE 308
CVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAAETNVCSSE
Sbjct: 317 CVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAAETNVCSSE 376
Query: 309 MVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYLSEYESDGN 368
MVESDNPCVKQ+FLDGEHKGNSINEC+P+NDHLHLSLNLSYPKPMLEEDPYLSEYESDGN
Sbjct: 377 MVESDNPCVKQSFLDGEHKGNSINECIPANDHLHLSLNLSYPKPMLEEDPYLSEYESDGN 436
Query: 369 WDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDDKKINSVGL 428
WDIAESVDDDDNNIEEDYEDGEVRETMPETEV+VHMCEKRGIETFDHADCDDKKINSVGL
Sbjct: 437 WDIAESVDDDDNNIEEDYEDGEVRETMPETEVQVHMCEKRGIETFDHADCDDKKINSVGL 496
Query: 429 PNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAVDNTSIEDV 488
PNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKS EQENE RCVEEVHAV+ TSIEDV
Sbjct: 497 PNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSSEQENEDRCVEEVHAVEYTSIEDV 556
Query: 489 NRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAVDIVQNKDL 548
NRPVKAAGRNQLSQYDERD+FE QDTADKAIDGIQ+LIPAVSQGEVESAIAVDIVQNKDL
Sbjct: 557 NRPVKAAGRNQLSQYDERDDFEGQDTADKAIDGIQQLIPAVSQGEVESAIAVDIVQNKDL 616
Query: 549 ILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSRTDREFVPS 608
ILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSRTDREFVPS
Sbjct: 617 ILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSRTDREFVPS 676
Query: 609 MALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTRPGEWDFGP 668
MALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTRPGEWDFGP
Sbjct: 677 MALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTRPGEWDFGP 736
Query: 669 NFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFHGPSRRKSP 728
NFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFHGPSRRKSP
Sbjct: 737 NFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFHGPSRRKSP 796
Query: 729 GRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDPLYARPQLP 788
GRRHGPGVHGGKMVNRIPRDFSPNRC+DEGGSFDRQHGEKFTRNFADDTEDPLYARPQLP
Sbjct: 797 GRRHGPGVHGGKMVNRIPRDFSPNRCLDEGGSFDRQHGEKFTRNFADDTEDPLYARPQLP 856
Query: 789 YEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGRSDMIHRRP 848
YEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGRSDMI RRP
Sbjct: 857 YEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGRSDMIPRRP 916
Query: 849 PSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSLIPNRNQNE 908
PSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHG IRSLIPNRNQNE
Sbjct: 917 PSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGSIRSLIPNRNQNE 976
Query: 909 RLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERHEPVHSFKH 968
RLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERHEPVHSFKH
Sbjct: 977 RLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERHEPVHSFKH 1036
Query: 969 PYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
PYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR
Sbjct: 1037 PYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1072
BLAST of Cp4.1LG01g24100 vs. NCBI nr
Match:
XP_022990227.1 (uncharacterized protein LOC111487178 isoform X1 [Cucurbita maxima] >XP_022990228.1 uncharacterized protein LOC111487178 isoform X1 [Cucurbita maxima] >XP_022990229.1 uncharacterized protein LOC111487178 isoform X2 [Cucurbita maxima] >XP_022990230.1 uncharacterized protein LOC111487178 isoform X2 [Cucurbita maxima] >XP_022990231.1 uncharacterized protein LOC111487178 isoform X2 [Cucurbita maxima] >XP_022990232.1 uncharacterized protein LOC111487178 isoform X2 [Cucurbita maxima] >XP_022990233.1 uncharacterized protein LOC111487178 isoform X2 [Cucurbita maxima] >XP_022990234.1 uncharacterized protein LOC111487178 isoform X2 [Cucurbita maxima] >XP_022990235.1 uncharacterized protein LOC111487178 isoform X2 [Cucurbita maxima])
HSP 1 Score: 1949 bits (5050), Expect = 0.0
Identity = 971/1004 (96.71%), Postives = 985/1004 (98.11%), Query Frame = 0
Query: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ
Sbjct: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
Query: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGL NKNQDC+SNENKGESGTDSCY
Sbjct: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLLNKNQDCVSNENKGESGTDSCY 120
Query: 121 ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
DVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVT+KHTVHASPEICGGLKLSSTS
Sbjct: 121 VDVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTKKHTVHASPEICGGLKLSSTS 180
Query: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL 240
LNSDPHAGNKEEEIDVKVEGGAEVPVGLKED+KPKLVPEKSDMNFLKQNTKEH+LLDLSL
Sbjct: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDLKPKLVPEKSDMNFLKQNTKEHMLLDLSL 240
Query: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
NK ESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWES TSDAPVGQISSTQTNTAA
Sbjct: 241 NKPESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESSTSDAPVGQISSTQTNTAA 300
Query: 301 ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL 360
ETN CSSEMVESDNPCVKQ+FLDGE KGN INEC+PSNDHLHLSLNLSYPKPMLEEDPYL
Sbjct: 301 ETNACSSEMVESDNPCVKQSFLDGEPKGNCINECIPSNDHLHLSLNLSYPKPMLEEDPYL 360
Query: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD
Sbjct: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
Query: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 480
KKI+SVG PNREGFTLGSLEQETEPENLNVRSEDDVHTTT SKS EQENE RCVE+VHAV
Sbjct: 421 KKIDSVGFPNREGFTLGSLEQETEPENLNVRSEDDVHTTTTSKSSEQENEDRCVEDVHAV 480
Query: 481 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 540
+NTSIEDVNRPVK AGRNQLSQYDERDNFE QDTADKAIDGIQEL+PAVSQGEVESAIAV
Sbjct: 481 ENTSIEDVNRPVKTAGRNQLSQYDERDNFEGQDTADKAIDGIQELVPAVSQGEVESAIAV 540
Query: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR
Sbjct: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
Query: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDT+
Sbjct: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTQ 660
Query: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQL DDEG YFFH
Sbjct: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLSDDEGRYFFH 720
Query: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 780
G SRRKSPGRRHGP VHGGKMVNRIPRDFSPNRC+DEGGSFDRQHGEKFTRNFADDTEDP
Sbjct: 721 GSSRRKSPGRRHGPVVHGGKMVNRIPRDFSPNRCLDEGGSFDRQHGEKFTRNFADDTEDP 780
Query: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR
Sbjct: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
Query: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL
Sbjct: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
Query: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH
Sbjct: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
Query: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
+PVHSFKHPYDDSDGERFRNNGED SRPFRFC ENDSRI+WKRR
Sbjct: 961 DPVHSFKHPYDDSDGERFRNNGEDYSRPFRFCGENDSRIAWKRR 1004
BLAST of Cp4.1LG01g24100 vs. ExPASy TrEMBL
Match:
A0A6J1H386 (uncharacterized protein LOC111459702 OS=Cucurbita moschata OX=3662 GN=LOC111459702 PE=4 SV=1)
HSP 1 Score: 1978 bits (5125), Expect = 0.0
Identity = 985/1004 (98.11%), Postives = 996/1004 (99.20%), Query Frame = 0
Query: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ
Sbjct: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
Query: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
PSPPKDELFHI+ANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDC+SNENKGESGTDSCY
Sbjct: 61 PSPPKDELFHINANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCVSNENKGESGTDSCY 120
Query: 121 ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
DVVQSDSGMPVVKFQESSLG HVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS
Sbjct: 121 VDVVQSDSGMPVVKFQESSLGRHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
Query: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL 240
LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNT+EHVLLDLSL
Sbjct: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTEEHVLLDLSL 240
Query: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA
Sbjct: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
Query: 301 ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL 360
ETNVCSSEMVESDNPCVKQ+FLDGEHKGNSINEC+P+NDHLHLSLNLSYPKPMLEEDPYL
Sbjct: 301 ETNVCSSEMVESDNPCVKQSFLDGEHKGNSINECIPANDHLHLSLNLSYPKPMLEEDPYL 360
Query: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEV+VHMCEKRGIETFDHADCDD
Sbjct: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVQVHMCEKRGIETFDHADCDD 420
Query: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 480
KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKS EQENE RCVEEVHAV
Sbjct: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSSEQENEDRCVEEVHAV 480
Query: 481 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 540
+ TSIEDVNRPVKAAGRNQLSQYDERD+FE QDTADKAIDGIQ+LIPAVSQGEVESAIAV
Sbjct: 481 EYTSIEDVNRPVKAAGRNQLSQYDERDDFEGQDTADKAIDGIQQLIPAVSQGEVESAIAV 540
Query: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR
Sbjct: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
Query: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR
Sbjct: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
Query: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH
Sbjct: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
Query: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 780
GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRC+DEGGSFDRQHGEKFTRNFADDTEDP
Sbjct: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCLDEGGSFDRQHGEKFTRNFADDTEDP 780
Query: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR
Sbjct: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
Query: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
SDMI RRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHG IRSL
Sbjct: 841 SDMIPRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGSIRSL 900
Query: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH
Sbjct: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
Query: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR
Sbjct: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
BLAST of Cp4.1LG01g24100 vs. ExPASy TrEMBL
Match:
A0A6J1JMD5 (uncharacterized protein LOC111487178 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487178 PE=4 SV=1)
HSP 1 Score: 1949 bits (5050), Expect = 0.0
Identity = 971/1004 (96.71%), Postives = 985/1004 (98.11%), Query Frame = 0
Query: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ
Sbjct: 1 MMAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQ 60
Query: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCY 120
PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGL NKNQDC+SNENKGESGTDSCY
Sbjct: 61 PSPPKDELFHIDANLTSERPSLSVTIVSSSSAVTSSGLLNKNQDCVSNENKGESGTDSCY 120
Query: 121 ADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTS 180
DVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVT+KHTVHASPEICGGLKLSSTS
Sbjct: 121 VDVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTKKHTVHASPEICGGLKLSSTS 180
Query: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSL 240
LNSDPHAGNKEEEIDVKVEGGAEVPVGLKED+KPKLVPEKSDMNFLKQNTKEH+LLDLSL
Sbjct: 181 LNSDPHAGNKEEEIDVKVEGGAEVPVGLKEDLKPKLVPEKSDMNFLKQNTKEHMLLDLSL 240
Query: 241 NKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAA 300
NK ESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWES TSDAPVGQISSTQTNTAA
Sbjct: 241 NKPESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESSTSDAPVGQISSTQTNTAA 300
Query: 301 ETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYL 360
ETN CSSEMVESDNPCVKQ+FLDGE KGN INEC+PSNDHLHLSLNLSYPKPMLEEDPYL
Sbjct: 301 ETNACSSEMVESDNPCVKQSFLDGEPKGNCINECIPSNDHLHLSLNLSYPKPMLEEDPYL 360
Query: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD
Sbjct: 361 SEYESDGNWDIAESVDDDDNNIEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 420
Query: 421 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 480
KKI+SVG PNREGFTLGSLEQETEPENLNVRSEDDVHTTT SKS EQENE RCVE+VHAV
Sbjct: 421 KKIDSVGFPNREGFTLGSLEQETEPENLNVRSEDDVHTTTTSKSSEQENEDRCVEDVHAV 480
Query: 481 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 540
+NTSIEDVNRPVK AGRNQLSQYDERDNFE QDTADKAIDGIQEL+PAVSQGEVESAIAV
Sbjct: 481 ENTSIEDVNRPVKTAGRNQLSQYDERDNFEGQDTADKAIDGIQELVPAVSQGEVESAIAV 540
Query: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR
Sbjct: 541 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 600
Query: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 660
TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDT+
Sbjct: 601 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTQ 660
Query: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 720
PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQL DDEG YFFH
Sbjct: 661 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLSDDEGRYFFH 720
Query: 721 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 780
G SRRKSPGRRHGP VHGGKMVNRIPRDFSPNRC+DEGGSFDRQHGEKFTRNFADDTEDP
Sbjct: 721 GSSRRKSPGRRHGPVVHGGKMVNRIPRDFSPNRCLDEGGSFDRQHGEKFTRNFADDTEDP 780
Query: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR
Sbjct: 781 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 840
Query: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL
Sbjct: 841 SDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQGFRYLSPSDDMMRDVGPAPDHGPIRSL 900
Query: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH
Sbjct: 901 IPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERH 960
Query: 961 EPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAENDSRISWKRR 1004
+PVHSFKHPYDDSDGERFRNNGED SRPFRFC ENDSRI+WKRR
Sbjct: 961 DPVHSFKHPYDDSDGERFRNNGEDYSRPFRFCGENDSRIAWKRR 1004
BLAST of Cp4.1LG01g24100 vs. ExPASy TrEMBL
Match:
A0A6J1BVM7 (uncharacterized protein LOC111006120 OS=Momordica charantia OX=3673 GN=LOC111006120 PE=4 SV=1)
HSP 1 Score: 1439 bits (3725), Expect = 0.0
Identity = 760/1044 (72.80%), Postives = 842/1044 (80.65%), Query Frame = 0
Query: 2 MAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNL------ 61
MA+PESEEV FKRIGLSA+DY A LPIKKRRFP+VQSPPSPSKDISSFHPDGNL
Sbjct: 1 MAMPESEEVCFKRIGLSASDYDACLPIKKRRFPLVQSPPSPSKDISSFHPDGNLVKTEQP 60
Query: 62 --------------MKIEQPSPPKD-ELFHIDANLT-SERPSLSVTIVSSSSAVTSSGLS 121
MK EQPSP KD FH D NL +E+P +S T VSSSSAVTS LS
Sbjct: 61 SSSKDISVNPNGNLMKTEQPSPSKDISSFHRDGNLIKTEQPGISATTVSSSSAVTSYELS 120
Query: 122 NKNQDCISNENKGESGTDSCYADVVQSDSGMPVVKFQES--SLGGHVS----LNGYVECE 181
NKNQ+C+ +ENKG+S TDSCY D QS GM VKFQE +LG ++ YVE E
Sbjct: 121 NKNQECVFDENKGKSDTDSCYLDRFQSKIGMAGVKFQEPQPNLGDRACFSDYVDDYVEYE 180
Query: 182 DKSLVTEKHTVHASPEICGGLKLSSTSLNSDPHAGNKEEEIDVK------------VEGG 241
+KSL+TEKHT+HAS EI GGLKLSSTSLN DP GN+EEEI VK VEG
Sbjct: 181 EKSLITEKHTLHASSEIPGGLKLSSTSLNFDPLTGNEEEEIAVKNPEEKRTSPICQVEGR 240
Query: 242 AEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSLNKQESGTHCVKGNAGSDYDGS 301
A + VGLK + PKLVPE +D+ LK + E VLLDLSL+KQ S + CV+G+ GSDYDGS
Sbjct: 241 AALSVGLKGHMVPKLVPENNDITLLKHSILEPVLLDLSLSKQGSSSPCVRGSIGSDYDGS 300
Query: 302 LLHSNRENWDLNTSMDSWESCTSDAPVGQISSTQTNTAAETNVCSSEMVESDNPCVKQTF 361
+LHSNRENWDLNTSM+SWE CTSDA QIS+TQTN T VCSSEMVE D+P KQ
Sbjct: 301 ILHSNRENWDLNTSMESWEGCTSDAAAVQISATQTNMDVGTYVCSSEMVEGDSPRGKQIP 360
Query: 362 LDGEHKGNSINECVPSNDHLHLSLNLSYPKPMLEEDPYLSEYESDGNWDIAESVDDDDNN 421
LD EH+ NSIN CVPS DHLHLSL+ SYPKP LEEDPYLS+YESDGNWD+A++VD DDNN
Sbjct: 361 LDSEHRDNSINACVPSKDHLHLSLHSSYPKPTLEEDPYLSDYESDGNWDLADAVDYDDNN 420
Query: 422 IEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDDKKINSVGLPNREGFTLGSLEQ 481
+EEDYEDGEVRETM ETEVEVH CEKR +E FDHADCDDKKIN VGLP+++ FTLG +EQ
Sbjct: 421 VEEDYEDGEVRETMLETEVEVHECEKREVERFDHADCDDKKINYVGLPDQDSFTLGLVEQ 480
Query: 482 ETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAVDNTSIEDVNRPVKAAGRNQLS 541
E +PE+L+VR+EDDVHT T KS EQENE C +EVHAV+NT EDVNRPVKA GR+QLS
Sbjct: 481 EAKPEHLDVRNEDDVHTATKCKSSEQENEDLCEKEVHAVENTINEDVNRPVKATGRSQLS 540
Query: 542 QYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAVDIVQNKDLILPSVKESVSSDD 601
YD++DNFE Q TADK IDGIQELI VSQ VE+AIAVD+VQNKD+ LP+VKESV+SDD
Sbjct: 541 LYDKQDNFEGQATADKNIDGIQELISTVSQ-HVENAIAVDVVQNKDVALPNVKESVNSDD 600
Query: 602 VKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSRTDREFVPSMALEGANVQPQER 661
KD+ G KNSRIINLNR S+DSTPCK KS FVR VLSRTDR+F+P+MALEGANVQPQER
Sbjct: 601 AKDVNGGIKNSRIINLNRASSDSTPCKVKSSFVRPVLSRTDRDFIPNMALEGANVQPQER 660
Query: 662 DDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTRPGEWDFGPNFSPETYTDQQID 721
D +G+T KK SVDR QD S W NFS RRGR++NRLDTR GEW+ GPNFSPETY+DQQID
Sbjct: 661 DHTFGNTNKKISVDRHQDPSPWMNFSRRRGRNSNRLDTRSGEWNLGPNFSPETYSDQQID 720
Query: 722 YHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFHGPSRRKSPGRRHGPGVHGGKM 781
YHVPGLDQNRY IIPDGPFGGA+HRGRQL DDEGP FFHGPSRRKSPGRRHG GGKM
Sbjct: 721 YHVPGLDQNRYDIIPDGPFGGASHRGRQLLDDEGP-FFHGPSRRKSPGRRHGG--QGGKM 780
Query: 782 VNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDPLYARPQLPYEVDRPFFRERRN 841
VNRI RDFSP+RCMDEGGSFDRQHGE FTRNF D T DP+YARPQ PYE DR FFRERRN
Sbjct: 781 VNRIHRDFSPSRCMDEGGSFDRQHGENFTRNFPDGTMDPIYARPQPPYEDDRSFFRERRN 840
Query: 842 FSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGRSDMIHRRPPSYRMDRIRSPDQ 901
FSFQRK FPRIDSKSPVRSRARSP QWF SKRSDRFCGR M HRR P+YR DR+RSPDQ
Sbjct: 841 FSFQRKGFPRIDSKSPVRSRARSPGQWFPSKRSDRFCGRPGMTHRRSPNYRTDRMRSPDQ 900
Query: 902 PPIRGRMAGRRQGFRYLSPSDDMMRDVGPAP-DHGPIRSLIPNRNQNERLPLRNRSFDAI 961
P+ G MA RR GFR++SPSDDM RDVGP P DHG +RS+IPNRNQ ERL LRNRS+D I
Sbjct: 901 RPMGGHMAARRHGFRFISPSDDM-RDVGPVPPDHGHMRSIIPNRNQTERLSLRNRSYDGI 960
Query: 962 DPRGRIESDELFDGPVRSGQLSGYNGGEHEDDERRFNERHEPVHSFKHPYDDSDGERFRN 1004
DPRGRIESDELFD PVRSGQLSG +GG H+DDER FNERHEP+HSFKHPYDDSDGERFRN
Sbjct: 961 DPRGRIESDELFDDPVRSGQLSGCSGGNHDDDERIFNERHEPLHSFKHPYDDSDGERFRN 1020
BLAST of Cp4.1LG01g24100 vs. ExPASy TrEMBL
Match:
A0A0A0KU39 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G504130 PE=4 SV=1)
HSP 1 Score: 1366 bits (3536), Expect = 0.0
Identity = 721/1030 (70.00%), Postives = 817/1030 (79.32%), Query Frame = 0
Query: 2 MAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQP 61
M + ESEEVGFKRIGLSA+DY A++PIKKRRFP VQ PSPSKDISSFH DGNL+K+EQP
Sbjct: 1 MTIAESEEVGFKRIGLSASDYEANIPIKKRRFPGVQLTPSPSKDISSFHSDGNLLKVEQP 60
Query: 62 SPPKD-ELFHIDANLT-SERPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSC 121
SPPKD F+ + NL SE P LSVT VSSSS VTS LSN NQD +S E KG+S TDSC
Sbjct: 61 SPPKDVSSFNHNENLIKSEEPILSVTTVSSSSVVTSCALSNNNQDSVSEEKKGKSDTDSC 120
Query: 122 YADVVQSDSGMPVVKFQESSLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSST 181
D+VQS+ G VKFQE SLG H +G+VECE KSLVT +HT HASP IC GLKL ST
Sbjct: 121 CVDIVQSNIGAAGVKFQEPSLGRHACTDGFVECEGKSLVTVEHTDHASPVICAGLKLLST 180
Query: 182 SLNSDPHAGNKEEEIDVKVE-----------GGAEVPVGLKEDVKPKLVPEKSDMNFLKQ 241
SL+SD AGNKEEEIDVK+ GGA V VGLK + KLV EKSD+NFLKQ
Sbjct: 181 SLDSDHFAGNKEEEIDVKMPEENCSPPICQLGGAGVLVGLKGHMDLKLVSEKSDLNFLKQ 240
Query: 242 NTKEHVLLDLSLNKQESGTHCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTS-DAP 301
N+ E VLL+ +LNKQ S T CVKGN G D DGS L SNRE WDLNTSM+SWE CTS DAP
Sbjct: 241 NSMEPVLLNFALNKQGSSTQCVKGNVGFDCDGSFLQSNREKWDLNTSMESWEGCTSGDAP 300
Query: 302 VGQISSTQTNTAAETNVCSSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNL 361
V QIS+T+TNT ET CSSEMVESD+PC KQT LD E KG+S E HLHLSL+
Sbjct: 301 VVQISATRTNTTIETYSCSSEMVESDSPCGKQTLLDNEDKGDSTKE------HLHLSLDS 360
Query: 362 SYPKPMLEEDPYLSEYESDGNWDIAESVDDDD-----------NNIEEDYEDGEVRETMP 421
SY K +L+EDPY+SEYESDGNWDIAE+VDD+D NN+EEDYEDGEVRETM
Sbjct: 361 SYLKSVLDEDPYISEYESDGNWDIAETVDDNDDNDDNDNDDNDNNVEEDYEDGEVRETMQ 420
Query: 422 ETEVEVHMCEKRGIETFDHADCDDKKINSVGLPNREGFTLGSLEQETEPENLNVRSEDD- 481
ETEVEVH+ EKR IE DHA C+DKKINSVGL + E FTLG +QET+ ENL+ RSED+
Sbjct: 421 ETEVEVHVYEKREIEPLDHAGCNDKKINSVGLLDHEFFTLGPKKQETKLENLDYRSEDED 480
Query: 482 -VHTTTISKSFEQENEGRCVEEVHAVDNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDT 541
V TTT S S+EQENE CV+E+HAV+N EDVN KA R+QLSQYD++ NFE Q T
Sbjct: 481 EVQTTTKSNSYEQENEDLCVKELHAVENAIGEDVNISAKATERSQLSQYDKKGNFEGQGT 540
Query: 542 ADKAIDGIQELIPAVSQGEVESAIAVDIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRI 601
ADK ++ +E +P SQ EVE+A+AVD+VQN+DL LP+VKESV+ DD KDI GT+NSRI
Sbjct: 541 ADKILN--EEPVPTFSQNEVENAVAVDVVQNRDLTLPTVKESVNEDDAKDINGGTRNSRI 600
Query: 602 INLNRGSADSTPCKEKSGFVRSVLSRTDREFVPSMALEGANVQPQERDDAYGDTTKKFSV 661
IN NR S DSTPCK KS F + VLS DREFVP+M +E AN++PQERDD Y + +KK S+
Sbjct: 601 INFNRTSTDSTPCKAKSNFAKPVLSHKDREFVPNMVVERANMKPQERDDVYSNISKKISI 660
Query: 662 DRSQDQSQWKNFSHRRGRSTNRLDTRPGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKI 721
D+ Q FSHRRGR+TNRLD R EWDFGPNFSPETY++QQIDYHV GLDQNRYKI
Sbjct: 661 DKRQGPPPLMGFSHRRGRNTNRLDNRSEEWDFGPNFSPETYSEQQIDYHVTGLDQNRYKI 720
Query: 722 IPDGPFGGANHRGRQLPDDEGPYFFHGPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRC 781
IPDGPFGGAN RGR+L +DE P+FFHGPSRRKSPGRRHG V GGKMVNR+PRDFSP RC
Sbjct: 721 IPDGPFGGANRRGRELVEDEEPFFFHGPSRRKSPGRRHGHSVRGGKMVNRMPRDFSPGRC 780
Query: 782 MDEGGSFDRQHGEKFTRNFADDTEDPLYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDS 841
MDEGGSFDRQHGEKFTRNFADDT D +Y RPQ PY+VDRPFFRERRNFSFQRK+FP+IDS
Sbjct: 781 MDEGGSFDRQHGEKFTRNFADDTVDEMYPRPQPPYDVDRPFFRERRNFSFQRKTFPKIDS 840
Query: 842 KSPVRSRARSPNQWFSSKRSDRFCGRSDMIHRRPPSYRMDRIRSPDQPPIRGRMAGRRQG 901
KSPVRSRARSP+QWFSSKRSDRFC R +M HRR P+Y DR+RSPDQ IRG M G+RQG
Sbjct: 841 KSPVRSRARSPSQWFSSKRSDRFCERPNMTHRRSPNYMTDRMRSPDQRSIRGYMPGQRQG 900
Query: 902 FRYLSPSDDMMRDVGPAPDHGPIRSLIPNRNQNERLPLRNRSFDAIDPRGRIESDELFDG 961
FRYLSP D++ RDVGPAPDHG +R IPNRNQ +RLPLRNRS+DAIDPRGRIE+D LF G
Sbjct: 901 FRYLSPPDEL-RDVGPAPDHGHMRPFIPNRNQTKRLPLRNRSYDAIDPRGRIENDGLFYG 960
Query: 962 PVRSGQLSGYNGGEHEDDERRFNERHEPVHSFKHPYDDSDGERFRNNGEDCSRPFRFCAE 1004
PVR GQL+GYNGGE +DDERRFNERHEP+HSFKH + DSDGER+RN GEDCSRPFRFCAE
Sbjct: 961 PVRLGQLTGYNGGEPDDDERRFNERHEPLHSFKHGFRDSDGERYRNKGEDCSRPFRFCAE 1020
BLAST of Cp4.1LG01g24100 vs. ExPASy TrEMBL
Match:
A0A0A0KNM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G503580 PE=4 SV=1)
HSP 1 Score: 1351 bits (3496), Expect = 0.0
Identity = 721/1066 (67.64%), Postives = 822/1066 (77.11%), Query Frame = 0
Query: 2 MAVPESEEVGFKRIGLSATDYGASLPIKKRRFPVVQSPPSPSKDISSFHPDGNLMKIEQP 61
M + ESEEV FKR GLSA+DY ASLPIKKRRFPVVQ PPSPSKD+ SFH DGNL+K EQ
Sbjct: 1 MTLIESEEVRFKRTGLSASDYDASLPIKKRRFPVVQFPPSPSKDLPSFHSDGNLLKAEQL 60
Query: 62 SPPK-------------------------------------------DELFHIDANLTSE 121
SPPK H + + +E
Sbjct: 61 SPPKVSSSNCNESLIKTEQPSSPKEPSSFNSNESLLKTKQPSPSKDLSSFNHNENLIKTE 120
Query: 122 RPSLSVTIVSSSSAVTSSGLSNKNQDCISNENKGESGTDSCYADVVQSDSGMPVVKFQES 181
+P LS++IVSSSS VTSS L N +Q+ +S E KG+S TDSC D+VQSD G VKFQE
Sbjct: 121 QPILSMSIVSSSSVVTSSALLNNDQNNVSEEKKGKSDTDSCCEDIVQSDIGTAGVKFQEP 180
Query: 182 SLGGHVSLNGYVECEDKSLVTEKHTVHASPEICGGLKLSSTSLNSDPHAGNKEEEIDVK- 241
+LGGH ++ + E E KSLVT KHT+ SPEI GG SSTSL SDP AGNKEE IDVK
Sbjct: 181 TLGGHDYISCFDEYEGKSLVTVKHTIRKSPEIYGGSNRSSTSLYSDPLAGNKEEGIDVKM 240
Query: 242 -----------VEGGAEVPVGLKEDVKPKLVPEKSDMNFLKQNTKEHVLLDLSLNKQESG 301
V GGA V VGL + KLVPEKSD+NFLKQN+ E VLLDLSLNK S
Sbjct: 241 PEENCSPPICEVGGGAGVSVGLNCHMDLKLVPEKSDLNFLKQNSVEPVLLDLSLNKHGSS 300
Query: 302 THCVKGNAGSDYDGSLLHSNRENWDLNTSMDSWESCTS-DAPVGQISSTQTNTAAETNVC 361
T CVK N GSD DG LL NRE WDLNTSM+SWE CT D+PV Q+S+TQTNT ET+ C
Sbjct: 301 TQCVKDNVGSDCDGPLLQLNREKWDLNTSMESWEGCTGGDSPVVQMSATQTNTTIETHAC 360
Query: 362 SSEMVESDNPCVKQTFLDGEHKGNSINECVPSNDHLHLSLNLSYPKPM---LEEDPYLSE 421
SEMVESD+PC KQT LDGE KGNSI +C+PS ++L LSL+ SY KP+ LEEDPY+SE
Sbjct: 361 PSEMVESDSPCGKQTLLDGEDKGNSIYDCMPSKENLDLSLDSSYLKPVQPVLEEDPYISE 420
Query: 422 YESDGNWDIAESVDDDDNN--IEEDYEDGEVRETMPETEVEVHMCEKRGIETFDHADCDD 481
YESDGNWDIAE+VDDDDN+ +EEDYEDGEVRET+ E+EVEV EKR IE DHA CDD
Sbjct: 421 YESDGNWDIAEAVDDDDNDNHLEEDYEDGEVRETLQESEVEVLAYEKREIEPLDHAGCDD 480
Query: 482 KKINSVGLPNREGFTLGSLEQETEPENLNVRSEDDVHTTTISKSFEQENEGRCVEEVHAV 541
KKINS+ LP+ E LG LEQET+PENL++RSEDDV TTT SKS+EQENE CV+E+HAV
Sbjct: 481 KKINSIRLPDHELHALGPLEQETKPENLDLRSEDDVRTTTNSKSYEQENEDLCVKELHAV 540
Query: 542 DNTSIEDVNRPVKAAGRNQLSQYDERDNFEDQDTADKAIDGIQELIPAVSQGEVESAIAV 601
+NT DVN+ VK GR QL Q+D++ NFE QDTAD+ +D +ELIP SQGEVE+A+AV
Sbjct: 541 ENTISGDVNKAVKVTGRGQLFQFDKKHNFEAQDTADEMVD--EELIPTFSQGEVENAVAV 600
Query: 602 DIVQNKDLILPSVKESVSSDDVKDIYSGTKNSRIINLNRGSADSTPCKEKSGFVRSVLSR 661
D+VQN+DL LP+VKESV+ DD KDI GT+NSRIIN NR S DSTPCKEKS F RSVLS
Sbjct: 601 DVVQNRDLTLPTVKESVNEDDAKDINGGTRNSRIINFNRASIDSTPCKEKSSFSRSVLSH 660
Query: 662 TDREFVPSMALEGANVQPQERDDAYGDTTKKFSVDRSQDQSQWKNFSHRRGRSTNRLDTR 721
+REFVP+MA+EGAN+QPQERDDAY + TKK S+D+ + Q FSHRRGRS+NRLD R
Sbjct: 661 KEREFVPNMAVEGANMQPQERDDAYSNITKKISIDKREGQPPLMGFSHRRGRSSNRLDHR 720
Query: 722 PGEWDFGPNFSPETYTDQQIDYHVPGLDQNRYKIIPDGPFGGANHRGRQLPDDEGPYFFH 781
EWDFGPNFSPETY++QQIDYHVPGLDQNRYKI PDGPFGGAN RGR+L +DE P+FFH
Sbjct: 721 SEEWDFGPNFSPETYSEQQIDYHVPGLDQNRYKITPDGPFGGANRRGRELLEDEEPFFFH 780
Query: 782 GPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCMDEGGSFDRQHGEKFTRNFADDTEDP 841
GPSRRKS GRRHGP V GGKMV +IPRDFSP RCMDEGGSFDRQHGEKF+RNFADDT D
Sbjct: 781 GPSRRKSLGRRHGPNVGGGKMVYKIPRDFSPGRCMDEGGSFDRQHGEKFSRNFADDTVDL 840
Query: 842 LYARPQLPYEVDRPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPNQWFSSKRSDRFCGR 901
+Y RPQ PY++D+PFFRERRNFSFQRKSFPRIDSKSPVRSRARSP QWFSSKRSDRFC R
Sbjct: 841 MYPRPQPPYDIDKPFFRERRNFSFQRKSFPRIDSKSPVRSRARSPGQWFSSKRSDRFCER 900
Query: 902 SDMIHRRPPSYRMDRIRSPDQPPIRGRMA-GRRQGFRYLSPSDDMMRDVGPAPDHGPIRS 961
SDM HRR P+YR +R+RSPDQ PIRG M GRRQGF +LS SD+M RDVGPAPDHG +RS
Sbjct: 901 SDMTHRRSPNYRSERMRSPDQRPIRGHMPPGRRQGFHFLSASDEM-RDVGPAPDHGHMRS 960
Query: 962 LIPNRNQNERLPLRNRSFDAIDPRGRIESDELFDGP-VRSGQLSGYNGGEHEDDERRFNE 1004
+IP+RNQ ERLPLRNRS+DAIDP+GRIE+D+ F GP VR GQL+GYN G +DDERRFNE
Sbjct: 961 IIPDRNQTERLPLRNRSYDAIDPQGRIENDDFFYGPPVRLGQLTGYNDGVPDDDERRFNE 1020
BLAST of Cp4.1LG01g24100 vs. TAIR 10
Match:
AT5G13590.1 (unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; Bacteria - 8; Metazoa - 80; Fungi - 5; Plants - 17; Viruses - 0; Other Eukaryotes - 40 (source: NCBI BLink). )
HSP 1 Score: 77.8 bits (190), Expect = 5.5e-14
Identity = 90/294 (30.61%), Postives = 131/294 (44.56%), Query Frame = 0
Query: 698 GPFGGANHRGRQLPDDEG--PYFFHGPSRRKSPGRRHGPGVHGGKMVNRIPRDFSPNRCM 757
G F RGR+ P ++G PY P R+SP + G P
Sbjct: 826 GAFMSNFQRGRR-PANDGVTPYAHSFP--RRSPSFSYNRG---------------PTNKE 885
Query: 758 DEGGSFDRQHGEKFTRNFADDTEDPLYARPQLPYEVDRPFFRERRNF-SFQRKSFPRIDS 817
D + GEKFTR + +PL+ Q PY F R R F + ++ FP S
Sbjct: 886 DTSAFHGFRDGEKFTRGLQCNNTEPLFMNHQRPYRGRSGFARGRTKFVNNPKRDFPGFRS 945
Query: 818 KSPVRSRARS--PNQWFSSKRSDRFCGRSDMIHRRPPS-YRMDRIRSPDQPPIRGRMAGR 877
+SPVRSR RS + F ++ + F G +D HRR PS Y+++R+ SPD M R
Sbjct: 946 RSPVRSRERSDGSSSSFRNRSQEEFSGHTDFSHRRSPSGYKVERMSSPDHSGYSREMVVR 1005
Query: 878 RQGFRYLS--PSDDMMRDVGPAPDHGPIRSLIPNRNQN------ERLPLRNR-SFDAIDP 937
R S PS + R G A G +R R+ N + + RN + + +DP
Sbjct: 1006 RHNSPPFSHRPS-NAGRGRGYARGRGYVRGRGYGRDGNSFRKPSDHVVHRNHGNMNNLDP 1065
Query: 938 RGRIE-SDELFDGPVRSGQLSGYNGGEHEDDERRFNERHEPVHSFKHPYDDSDG 976
R R++ SD+ F+G + S + G + + RRF RH+ S P ++DG
Sbjct: 1066 RERVDYSDDFFEGQIHSERF----GVDVNAERRRFGYRHDGTSSSFRPSFNNDG 1096
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023535366.1 | 0.0 | 100.00 | uncharacterized protein LOC111796821 [Cucurbita pepo subsp. pepo] | [more] |
XP_022958498.1 | 0.0 | 98.11 | uncharacterized protein LOC111459702 [Cucurbita moschata] | [more] |
KAG6602352.1 | 0.0 | 98.11 | hypothetical protein SDJN03_07585, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7033032.1 | 0.0 | 97.99 | hypothetical protein SDJN02_07085, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022990227.1 | 0.0 | 96.71 | uncharacterized protein LOC111487178 isoform X1 [Cucurbita maxima] >XP_022990228... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1H386 | 0.0 | 98.11 | uncharacterized protein LOC111459702 OS=Cucurbita moschata OX=3662 GN=LOC1114597... | [more] |
A0A6J1JMD5 | 0.0 | 96.71 | uncharacterized protein LOC111487178 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1BVM7 | 0.0 | 72.80 | uncharacterized protein LOC111006120 OS=Momordica charantia OX=3673 GN=LOC111006... | [more] |
A0A0A0KU39 | 0.0 | 70.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G504130 PE=4 SV=1 | [more] |
A0A0A0KNM2 | 0.0 | 67.64 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G503580 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT5G13590.1 | 5.5e-14 | 30.61 | unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; B... | [more] |