Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGATCTTGATCGGAGAGAAGAAAAACGGGACAATTAAAAATCCAAAGAACAGAGAAATTCAGATTACAGAGGCAATGGCCGTATCGGAGTCTGTAATTTGGCGGGGTGTTGAACAAACCTTGTGATCAATCTGCAACAAAACCCGGGAAGAAGAAGACGAAGAAGAACAAGAACAAGAGAGAAGAAGAAACCTCCAAAATCCCATCACAAATCACATCGCATTTCTCCACACTTCTCATCAAGCTCTCACCCATCTTCAAGATCTGTTCCCAATTCCAATTCCGCAACACACCCACTATTTGTTTCAAGGATTTTTCGTTTTGATTTCGATCGAACTCGTATTCGTTTTCGATTTCAATTGCGGGTTTTGCGTTCTCGTGGATACATTCTGAAACTCCCGAAGAAGACGTAAAGGAAGAGGAGAAGTGGCATCAGAAGCAGAGTCCAATGTTTGGATCTAGATTTTCCATTTTCGGAAGCGGTGCTGCTTCTGCTGCTGATAAGGTTGAGAAGTCGGCGAAGAGCGATGTTTTTCCTGGCCTGAAACTTCGCTCCGATAAAGATGTATATTGGCCTGGCGATCCGGTTGTTGTTACCATTGAAATTTCCTCCTCTGTCCCTCCATATGACTGCTCTCTTTTCATCGAGCGCCTTCGTTTTGAGATTATTGGCCTCCAGAAGTTGGACGCTCAGTGGTTTAGCACTCAGAAGCCAATCCCTGGATCCAAACAACGGAGAGGTTCCAATTCCCTTCTATATGATATTGTTCTTAGAATAGATATCTAAAAGAAGTTTGTTGATGGGTTTTCCCTGCTGATGTAGTTTCAATTTCATGTATTTTGGCTTTGTCAGGTGAACGCATTTTCATGGACTGCTCGGTACAATCAATAGTTTCAAATCAGATAATATCTTCGGGGGCCACTAAGTCATGTAAGCCTATGTGTTCTATGCCCTGCATTTATCTATCTCAGTTTCTGATATGCATGTTTTTCGCTTGAATATTGCTTATTTGGTACTTCGCCACCGTCGTTCCCTTTGTTTAATTCGTAGTACACTGTACGTTAGATGTCGTTATGACGTTCGGCCTACTATCAAATGACATGATGACCTTGGTGAAAGATTGTTCAGAATTATGAAGTAAATTTCTCGTTTCGAATTCTGAATTGCTGAAGTATTAATTTGAAGTATGATACAATTATTTGGAATTTCGATGGTAGACATTTTCAACTTCTGAATTCTGTGTCCCTTCACCTTGAATATGTCTCTTCTCCATAGAAGTTTGTCTAAAGCCATTTGTGGTAAAGGAATATCCAAAAAAAACTTGATGATTCGTTCTGTGAAACTTAGCCTCGAGAGATATTTACATAGGAGAGGTGCAACAAAGGAGAAATACTGCTCTGGAAATCTTGACAACTTGCTGTCCACCTTGCTATAGAGGATAAGAAACTCAAAATTATTGGTTTTTTCATTCCCAGTTTATGCAAGAGTTTTACTGTTTTTTGGTGCACCAAGCAATCTTTGGGCGGCAACTCTTTCGAGCTAAAAAGAAAGGTGTATCGGAATTGTGTGATCCAGAGGAAAAGTTTGACGGGAGAGTAATTATAGATTTTTTCATGATTAATCTAGAAATGAGTAGAAGTCGAAAACCACCTTGCATTCGAGTTTGTGAGAATTAAAGGACTTTCAGAAACAATGTTTAAACCACATACCGACCACCTATTTAAGATATTAATACTCCTAGCTATATGTTGGAGTAAATTGTCATTTCTTTTTGACTCATATTGGTTAGCTACTATTTTTTCCATTTGGTGAAGTTGCGGTATCTTTTCCTCTTTGTGGATGGTAAGAATTCCCAAGAAGGTTAAACTATTTAAGGGGCATGTACTTCATGGAAAAGTTAACACCTTGGATCAGGTCTTGAGGAAGTTGAGTTTCATCTGACTGAGGATCTTGATCACTTTGTGTGGAGATGTGATTACGATTTTCTCGGTTTTGAGGCGTTTGATTTTCACTTAGCTCAACCTAGAATTCATAGTTCCACGGTTTGAGGAGATTGAGCTCTTTCATCTGTCTTTTTGTGAAAAGGGGGCTCTTTTTTGTGGCTTGCTGAAGTTTGTTCAATTTGTGGGGCCTTTGAGTTTATAGGAACAATAAAATCTTTAGAGAGATAGAGAGACTTAGGACAGGGAGATTGATAGACTTAGGAGTAATGTGTGGCCCTTTGTTAGATTCTATGCGGACTTTACTTCTTAATTATTCTTTAGGTCTTATTTTGCTTGATCGACACCTCCTTTTGATTTTTCTTGGCTCTTTTTGGTTTTTTTTTTAATACTCTTTGTATTTGTATTCTTTCCTTTATTCTGAGTGGAAGCTTGGTTGTTTGTAAAAAAACATGAATATACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANNNNNNNNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCTAGAAATGAGGAGTTCATGATCATGATAGTTTTTAGGTCTCCTTTTGGAACTCTTTGGTGATTTTTCTTTAACTATGTTTCTTGTTTCCAATTAAAGCCCACTGGTGGAATTTTTGTAATGCCCTTGACTTCGGTTGATTCATCATCACCCTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTCTTTCTTTCAATCTAATGGAAGTTTCTTCTGTTTCCTATCAAACAAAACTGTGAGTGATATCTTATGAATGATTAGTTTTCCTAGATTGGTCGGAAATATGTCGAATTATAGTCATCGCTGAGTTTGTTTTTGTTTAGACTTGGATCCAGGAAGGCTTTCTGTTCAGCTTTGATGCTTCGAGAGACACCTCCTCCTTTTTTGGATAAGATTAAAGAAAAAATTCCAAACAGGGATGGAAATAGAAAGTGAAATCCACTACCAGAGGAGCTATGAAAAAGCTTTTACTTTGGAATGAATTAAAAACCAATACATCCAACACTGTTTGTCTTAAACTAAGGATTCATTTGTTTTTTACTCCATGCTTTAATTTACCATTTTGATTATGTACAGTAATTCTGAAAATGTTATTTGTTTTTACAGATGTGGTTCGAACAACACTTCCTAGCCGCATACCACCATCTTACAAGGGTGCCACTATTCGTTATATGTATTATGTTAAAAGTACATTACTAGGACGATGGCTAATACAGGAAAATGGTCGATCCCGTAAAGAGTCGCTGAAGGATCAAATAGAAATGGTTTGTTCAAAATTAACTTGCTTCCAAGATGAGTGAAAGTTCATTGTTTGGTTTGATGTGAATGAATATTAGGTGAACTGCAGCTAGGAATTTGTTTTTTACTTTGTTTTTCTTTGAAGCACGAACACTTTTGTGGTGCTTTCTAATCAAACTTTCTATGCCTGAACCCCCGTAGGTGTCTAATTTATCTGTTCTATATTATAATATGCACATTAAATAGGTATTTCTTTTTTAGCAAAATAAGCACAAGTTCTTTAATTTTTTTACAATGCATTACTTTTTCACTGGTATATTTATGTTTTCTCTACTTGTCCGCAATAAGATTATTTATCTTCTTATATATATATATATATATATATATATATATATTTATGAATTGCATTTTCAAATCTTGTGTGTTTAAAGATAGTTGTTAACGTGGATTTCTCAGTTACAGACAATAAATCGGTGCATGGGTTTATGATTGGACTAGAAAATTGGTGAAACAGTACCTGACCTTTGCCTCTCTTTGCACGTTTTTTATATTTCTGATGAAGATACTATCTAGCTTTATTTATTATCTATGTAGGAAGCTCGGGTTCCATTACAAGTGTGGGTCACTCAGAAAATCAGTGGCATGCTAATGGAAGAAGGTCAAAATGATGGTGATCTTTCTTCTCTCTGTTCTCTGTCGTTGGGAAAGTTGTATTTTTCATTGCATCCATTTAGAAGTTCAACGTGATTTCTGGAATTACACATCCATGGGAACCTTTGCATTTATGCAGGTAGCCCATATAGGTACTTGGTTATAGAGGGACTGGGCTTTATCTTCTCCAAATCGCTTGCTAACATGAATATAGGTTGAAGGAATAGGAGGCTGGTTAATGAGTAAATTTAATTCACCAATGACTAAATGGGTGCACAGGGACAGATTTTAAAATAAAGTTTATTGAATACATTTTAGAGTTATATTCATTTGATTATATATGTATTAGAGTATTCATGAATCACTATAATGATTAGTACATGTATTAGGTACTTAATATTTGGTACAGGGAGGATGTTCCAAAAGGTCTATTTAAGGACACTACGTAGGATTTCAAAAAGAACTCAACTTAAGAAGTGAAAACATCTCTTCCTCCAAAAAAAGACAGATTTGAGCGTTGAGAGATTTTTGCCCCCTTTTGCGTAAAATTTTACATTGCTCCAACAAGCTTGACTACGTTTTACAATAAACTAATGTTATCTACTGAGATTCGTACTAATCATGTTAACAATTATAAGAAGAAAGGAAACAATTTTCTTCAAGTACTACATAATCAAAGTTCATAATAATACTAAAGCAAAAAGAAGGGATTCAAGTGACTTATAGGATTCTGTTAGGGACTTTTGGGAATGCTGAGGTCTGAGGGGTTCAAGTGACCGATTGGTCTCTTAGGCATTTGGGTTCTTTTTTTTTTCTTTTTTTTGTTTTTTTTTTTTTTTTGTTGTAGTGTATTCTTTTAAGTTCCTTGTGTATTCCCATTTTTTATTCTTTTAAGTTCCTTGGGTTCCTTGTTGTAGTGTATTCCCATTTTTTGATTTCTATCAATTTGGAGCTTTTTTGAAATCCTTCTATTTTTTATTTGCTCTTCATTGAATTAATTTCTATGCACGTTATCTGTATATGATTGGACTAGAAAATTAGTGGTGAACCATATCAGTGGAAAAACTGAATTATTTAAAACCCTTGTTGTCTCTTGGACTCTGATACTTAACTACAAACAAAACTCAATATGATATGATGTTGTTGCATTTATTTGTTTAAGCCTTTCTATACTCATGTTCATATCATAATAAATGGATTTCTTCATTAACCATCCTCGTTTGACCCAATGGTCAAATGAATAAGCTATAAACCCATTTGTCTTTGAGGCCATTGATTCAGAATGTTGTAGGCATCAAGTATTTATTTTGTAAGGTATTCAGGATGTAGCCTATCTTAGGAACTCTTTGGATATCACTGTCGTAACGTCCGGTGATAGTACTGAGACACCAATATGATGAGCATGGGATTATGTATGTTTTCAGTTATATAGTATAGAATGCGAATTCACCCTTCGATTTCTCACCAGCTTTTTGGATTATTCATATAGTTTCTTTTCTGCACATACTATTACTGCTAGCGGACAGCTTGAGAGGTTCAAGCACTTCATGTCATTTAGAATCATTGAGACCACTTAAGAAACACTCTATTTATTATTCAATTTCTGTAATTTCTGTAATTTCTGTTTTTTTTTTTTAAATTTTATTTTAGTAGTATTAAATCAGTTTGGTTTCTACAACAAGATTATGGTAGAATGAGAATCAGTTTGCTTGTTTTTAGTAGTGTTTTAGTTTAAATGTTTTTCAATGGTTGTTTCTTCTGCAGCCTTTCAAATGGATGTATTTTGGAAAGAGATGGAAGGTGACACTGATTGGGTATGCCACTTACCAGGATTCATTGCTGTTAAACAAGTTTGTTTTATCATTTCATTTTGTTCTTGTCCTTGACGCTTTTGAATCAAATACTTTTCTTCATTTGTGATGTTCTGAAACAATTTTCAGATCTCTGAAAAACGATAATCGTAACCTTAGAGGGATATGTTTTTAATTTGGGTAGGGACAATTAGATTTTTAAATCAGGTTGAGATGAGGTGCGGATGTTTGACATTTAAAAATATAGTAGCGCATCGATGTTCTGTTCGTGAGGAATGTTAGATTTAAAAATTTTGAAATATAAAGAAAATTGTTGATCATTAATCATTTGCTACTCACATCCTCAGGTGCTTGTGAATTGAGTATATATCAGGCTGTAGTTAGTATATTCTTCAAAGTAATGACTGATGTTAGAGTTTTGAACCTCAGATATGAATACACAACCATTTCCTTGCAGGTCAGAGCAAATGATATATATGATGGCATTGATGAAGGATACGAAAGCTCAAGGGATGAGATCTCTTCTGTCTCATCATACAACCCAATGAGAGAACCTTTTCATAGAACCTTTGGAAGTTCACTGTCGTTTCAATCATCTGCTGCAAAATCTGGAAGATTTTCAATGAAGGATTCTCCTTTTACTGAAGGAGAGCGATTAAGTTTATCTTCTAATGTTGCACATCCTGGTCTCTCTGTTGCTGAGGTCCTATATGATTCCGCCGGTGGGTTTTTATGAACGAGCTATATTTTGCTCGAGGCATTTGGCTATAATCATTCTTTTTCATCTGCTTGTAAAGTCATCCCAATTAACGTTTTTGGTAGTAGAATCATTTCACTCTAGAACGGAGATAATAAGAGATTTAATTGTTATTTCGTTCAAGACAACGATTTTGACTAATAGGTTCTGTGGAAACAGATGGCACATCGCCTCAGAAGTCATCTGCAGTTGGAAGCCAGGCGTTGAACTTCGAGAAGAATCAGTCAAAAGATGATGATGCTGGAGTACCATCTTCGCCAAGGCCGAAAACTAATGAACCTGTAGCATGTAAATTATTAACTTTCTTGCCTCCCTTTTTAACTCTATTTTCTGATAAAAATCTTCTCGTTATCATTTAGTTTTATTTGTTGTCATGATGATATATCAAACAAATAATGGAAACTAATTACCTGCAAAAATTTCTTGCAGCAGAAGGCTTTATTCGAGGAAGATCTTACAATATCAGGGTAGATGACCAAGTTTTGCTTAGATTTTGCCCAAAAAATTCTGATTCAACCTATTATTTTAGTGATATGGTAAGTTTTTCTTCCAATATGACTAGGTTGAGCCCACTAATAGACATGAAGGTTAACTAGTTTCTTGTTTTTCAAAATTCACCAGATAGGTGGAACTCTTACCTTTTTCCATGAAGAAGGAGCAAGGAGATGCCTTGAGGTAACTATGGTGTTTAAATATGGACTAATTGGTTGGAGCATTAATAGGTCTTCTCATACAGGTTTCAATAGCTTTGGAGACTTCAGAGACTGTATCACGCCGTTTCGTCCATCCGTCCCGGAGGAACTCTCCAACAATTGTGAAGGTACTTTTTGAGAAGAATGTCGTATTTTGCTTGTCATTCTAAGAACTTAAGGGTTGAAATCACTTGACTTGTACTTGGGGGCGCTTTTGCTTGAGAAACCTTCAATTTTTTTGGCTCAGCCTTTTGGGAATATAGTTTATATCTTTTTCTTGGTGAAAGTCGGTTGAGGGGTGGTATAATGGGCGCATTATTTGTTTTAGTGGAGCAATGGGACCTCCATCTTGTGTTGTTTTCAGATGGGAGATGATGCCTTTAGGGATATCTGTTGCTTTCATTTTGTGTACGTCTTGAGCCTATTGTTCTTGACATATTATTGGGATTTGTAGGTTCAAAGTGATCATTATGAGGTTGTTGCTGATTTAATTCAGACAAGCTTTCTTTTCTCAATTCCCATGGATGGGCCAATGTCCTTTTCCACTCCACATGTATCTTTGCAGTGGGCACTTCGCTTTGAATTTTTTACCACCCCAAAGAATGTGGATTGGACCAGGTACTGACCCAAAAACTTCTTAAACTCGCCATCTTACGTAAATGAAATGGTAGTTTTTACCATTACCTGCCAGATACGACTGGACATTATGCCCCGGATCTTCGTTCTTATCACTTACCTGAGAATTTGTTGACGATTTTTCGTATGCTTTAATTCAAGTTTCACCCTTTCAGATATGAGCATCCTCTCATGATAGAAGGAAGAGAGAAAAGTGAATGGATTCTTCCAATCAATGTGCATGCTCCTCCATCTAGCTCTGCAGCTGCTCAGAACAGAAATGAGAGGTCTTTCTCTTTGGATCCCTTATGGATGCACAGCTGAGGTAACTGATTTAATTTGTTATATACCATGAAGCTGTATGTCTTCACTTTTTTCTTCATTTGTATCTCGAACCAATAACTGGAAAAAAACGTGTCGAATTCAAATTACTATAGGATCTTTTTAGACATGTGTTCTCCAAATATCTCCCTTTGTTCTGTAACAGCGCTTTCTCTTTTAGGCTTCCTCTCAAGGTTTTTGAAACGCGTCTACTAGGGAGAGGTTTTTGCACCTCCTATAAAGAGTATTTTGATCTCCTCCCCAATCCATGTGGGATTTCACAATCCACTCCCCTTTGAGGCCCAGTGTCCTCGCTGACACTCGCTCCTCTCTCCAATCGATGTGGGGTCTCACGATCCACCCCCCTTCGTGGCCTAGCGTCCTCGCTGGCACACGGCCCGGCGTTTGGCTCTGATACCATTTGTAACAACTCAAGCCCACTGTTAGCAGATATTGTCCTCTTTGGACTTTCCCTTTCGGGCTTCCTCTTAAGGTTTTTAAAACACGTTTGTTAGGGAGAGGTTTTCACACCCTTATAAAGAATTCTTCATTCTCCTCCCCATTTGAGGTGGGATCTCACAATCCACCCACCTTCGAGGCCCAACGTCCTCATTGGCACTCGTTCCTCTCTCCAATCGATGTGGGATCTCACGATCCACCCCCTTTGTGACCCAATGTCCTCGCTGGCATACTGCCTAGTGTCTGGCATTGATACCCTTTGTAACAACCCAACCTCACTGCTAGTAGATATTGTCCTCTTTTGGTTTTCCCTTTCGGGTGGAACCTCAAGGTTTTTAAAACACGTTTGTTAGGGAGAGGTTTCTACACCCTTATAAAAAATGTTTCGTTCTCCTCCCCAATCGATGTGGAAACTCACATGTCCCATCTTTATTTTTTCTCCATCTTATGATGAAGTTTCTTTTAAGGAACATGAAACAATAGCGAAGAATGTTGGGGTTCTTGTTGTGTGGTTTTGCACTACAAAAAGACGAGTGCTTTTATGACTAGGCTGCAGTTTTCTCATGATCACAATTGTGGTAGATGTTCTTCAAATAGTTCAAATAGGACTTTTAAGGCATTTGCGAATGCACTTGAAACCACAGCATACATATGGATCATCAATGGGTTAGTCTAGTAGTTTTCGAAGGCGATTGGACTCTGTGTGTCGGGTTTGATAAGGCTGTGTTGTGCATCACGCAGGTGCTTATGTTAAATGAAGTGCTTGTAGAATTCTCCAGCAATTTTAGCATCACCTCGACGTCTCGCTGCCAGGAAGTTTTCTGAAGAAACTAACATCTAGCAAATGCAGATGTCGAACTGAATCGCCCGAAGAAGATGATGGCGATGAGCGTATAGGCCATTGACGTTGCAGGTTGACTATGATATGTTGCAATTTATCGCCAGATGGTTCAGAGTATTTTTCCTTCAATTGTGCTATACATGTAAAGCAATATTATTTTGTGCTTGAAATTATTGTTCATGATCTTTCCTGAAAGCTTCCAATTTCAAGACAACTAAATAAATTAACATTTTTTTATAATATTAGTAAATG
mRNA sequence
CGATCTTGATCGGAGAGAAGAAAAACGGGACAATTAAAAATCCAAAGAACAGAGAAATTCAGATTACAGAGGCAATGGCCGTATCGGAGTCTGTAATTTGGCGGGGTGTTGAACAAACCTTGTGATCAATCTGCAACAAAACCCGGGAAGAAGAAGACGAAGAAGAACAAGAACAAGAGAGAAGAAGAAACCTCCAAAATCCCATCACAAATCACATCGCATTTCTCCACACTTCTCATCAAGCTCTCACCCATCTTCAAGATCTGTTCCCAATTCCAATTCCGCAACACACCCACTATTTGTTTCAAGGATTTTTCGTTTTGATTTCGATCGAACTCGTATTCGTTTTCGATTTCAATTGCGGGTTTTGCGTTCTCGTGGATACATTCTGAAACTCCCGAAGAAGACGTAAAGGAAGAGGAGAAGTGGCATCAGAAGCAGAGTCCAATGTTTGGATCTAGATTTTCCATTTTCGGAAGCGGTGCTGCTTCTGCTGCTGATAAGGTTGAGAAGTCGGCGAAGAGCGATGTTTTTCCTGGCCTGAAACTTCGCTCCGATAAAGATGTATATTGGCCTGGCGATCCGGTTGTTGTTACCATTGAAATTTCCTCCTCTGTCCCTCCATATGACTGCTCTCTTTTCATCGAGCGCCTTCGTTTTGAGATTATTGGCCTCCAGAAGTTGGACGCTCAGTGGTTTAGCACTCAGAAGCCAATCCCTGGATCCAAACAACGGAGAGGTGAACGCATTTTCATGGACTGCTCGGTACAATCAATAGTTTCAAATCAGATAATATCTTCGGGGGCCACTAAGTCATATGTGGTTCGAACAACACTTCCTAGCCGCATACCACCATCTTACAAGGGTGCCACTATTCGTTATATGTATTATGTTAAAAGTACATTACTAGGACGATGGCTAATACAGGAAAATGGTCGATCCCGTAAAGAGTCGCTGAAGGATCAAATAGAAATGGAAGCTCGGGTTCCATTACAAGTGTGGGTCACTCAGAAAATCAGTGGCATGCTAATGGAAGAAGGTCAAAATGATGGTAGCCCATATAGGTACTTGGTTATAGAGGGACTGGGCTTTATCTTCTCCAAATCGCTTGCTAACATGAATATAGCCTTTCAAATGGATGTATTTTGGAAAGAGATGGAAGGTGACACTGATTGGGTCAGAGCAAATGATATATATGATGGCATTGATGAAGGATACGAAAGCTCAAGGGATGAGATCTCTTCTGTCTCATCATACAACCCAATGAGAGAACCTTTTCATAGAACCTTTGGAAGTTCACTGTCGTTTCAATCATCTGCTGCAAAATCTGGAAGATTTTCAATGAAGGATTCTCCTTTTACTGAAGGAGAGCGATTAAGTTTATCTTCTAATGTTGCACATCCTGGTCTCTCTGTTGCTGAGGTCCTATATGATTCCGCCGGTTCTGTGGAAACAGATGGCACATCGCCTCAGAAGTCATCTGCAGTTGGAAGCCAGGCGTTGAACTTCGAGAAGAATCAGTCAAAAGATGATGATGCTGGAGTACCATCTTCGCCAAGGCCGAAAACTAATGAACCTGTAGCATCAGAAGGCTTTATTCGAGGAAGATCTTACAATATCAGGGTAGATGACCAAGTTTTGCTTAGATTTTGCCCAAAAAATTCTGATTCAACCTATTATTTTAGTGATATGATAGGTGGAACTCTTACCTTTTTCCATGAAGAAGGAGCAAGGAGATGCCTTGAGGTTTCAATAGCTTTGGAGACTTCAGAGACTGTATCACGCCGTTTCGTCCATCCGTCCCGGAGGAACTCTCCAACAATTGTGAAGGTTCAAAGTGATCATTATGAGGTTGTTGCTGATTTAATTCAGACAAGCTTTCTTTTCTCAATTCCCATGGATGGGCCAATGTCCTTTTCCACTCCACATGTATCTTTGCAGTGGGCACTTCGCTTTGAATTTTTTACCACCCCAAAGAATGTGGATTGGACCAGATATGAGCATCCTCTCATGATAGAAGGAAGAGAGAAAAGTGAATGGATTCTTCCAATCAATGTGCATGCTCCTCCATCTAGCTCTGCAGCTGCTCAGAACAGAAATGAGAGGTCTTTCTCTTTGGATCCCTTATGGATGCACAGCTGAGGTGCTTATGTTAAATGAAGTGCTTGTAGAATTCTCCAGCAATTTTAGCATCACCTCGACGTCTCGCTGCCAGGAAGTTTTCTGAAGAAACTAACATCTAGCAAATGCAGATGTCGAACTGAATCGCCCGAAGAAGATGATGGCGATGAGCGTATAGGCCATTGACGTTGCAGGTTGACTATGATATGTTGCAATTTATCGCCAGATGGTTCAGAGTATTTTTCCTTCAATTGTGCTATACATGTAAAGCAATATTATTTTGTGCTTGAAATTATTGTTCATGATCTTTCCTGAAAGCTTCCAATTTCAAGACAACTAAATAAATTAACATTTTTTTATAATATTAGTAAATG
Coding sequence (CDS)
ATGTTTGGATCTAGATTTTCCATTTTCGGAAGCGGTGCTGCTTCTGCTGCTGATAAGGTTGAGAAGTCGGCGAAGAGCGATGTTTTTCCTGGCCTGAAACTTCGCTCCGATAAAGATGTATATTGGCCTGGCGATCCGGTTGTTGTTACCATTGAAATTTCCTCCTCTGTCCCTCCATATGACTGCTCTCTTTTCATCGAGCGCCTTCGTTTTGAGATTATTGGCCTCCAGAAGTTGGACGCTCAGTGGTTTAGCACTCAGAAGCCAATCCCTGGATCCAAACAACGGAGAGGTGAACGCATTTTCATGGACTGCTCGGTACAATCAATAGTTTCAAATCAGATAATATCTTCGGGGGCCACTAAGTCATATGTGGTTCGAACAACACTTCCTAGCCGCATACCACCATCTTACAAGGGTGCCACTATTCGTTATATGTATTATGTTAAAAGTACATTACTAGGACGATGGCTAATACAGGAAAATGGTCGATCCCGTAAAGAGTCGCTGAAGGATCAAATAGAAATGGAAGCTCGGGTTCCATTACAAGTGTGGGTCACTCAGAAAATCAGTGGCATGCTAATGGAAGAAGGTCAAAATGATGGTAGCCCATATAGGTACTTGGTTATAGAGGGACTGGGCTTTATCTTCTCCAAATCGCTTGCTAACATGAATATAGCCTTTCAAATGGATGTATTTTGGAAAGAGATGGAAGGTGACACTGATTGGGTCAGAGCAAATGATATATATGATGGCATTGATGAAGGATACGAAAGCTCAAGGGATGAGATCTCTTCTGTCTCATCATACAACCCAATGAGAGAACCTTTTCATAGAACCTTTGGAAGTTCACTGTCGTTTCAATCATCTGCTGCAAAATCTGGAAGATTTTCAATGAAGGATTCTCCTTTTACTGAAGGAGAGCGATTAAGTTTATCTTCTAATGTTGCACATCCTGGTCTCTCTGTTGCTGAGGTCCTATATGATTCCGCCGGTTCTGTGGAAACAGATGGCACATCGCCTCAGAAGTCATCTGCAGTTGGAAGCCAGGCGTTGAACTTCGAGAAGAATCAGTCAAAAGATGATGATGCTGGAGTACCATCTTCGCCAAGGCCGAAAACTAATGAACCTGTAGCATCAGAAGGCTTTATTCGAGGAAGATCTTACAATATCAGGGTAGATGACCAAGTTTTGCTTAGATTTTGCCCAAAAAATTCTGATTCAACCTATTATTTTAGTGATATGATAGGTGGAACTCTTACCTTTTTCCATGAAGAAGGAGCAAGGAGATGCCTTGAGGTTTCAATAGCTTTGGAGACTTCAGAGACTGTATCACGCCGTTTCGTCCATCCGTCCCGGAGGAACTCTCCAACAATTGTGAAGGTTCAAAGTGATCATTATGAGGTTGTTGCTGATTTAATTCAGACAAGCTTTCTTTTCTCAATTCCCATGGATGGGCCAATGTCCTTTTCCACTCCACATGTATCTTTGCAGTGGGCACTTCGCTTTGAATTTTTTACCACCCCAAAGAATGTGGATTGGACCAGATATGAGCATCCTCTCATGATAGAAGGAAGAGAGAAAAGTGAATGGATTCTTCCAATCAATGTGCATGCTCCTCCATCTAGCTCTGCAGCTGCTCAGAACAGAAATGAGAGGTCTTTCTCTTTGGATCCCTTATGGATGCACAGCTGA
Protein sequence
MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPYDCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGATKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARVPLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGDTDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMKDSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSKDDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAPPSSSAAAQNRNERSFSLDPLWMHS
Homology
BLAST of Cp4.1LG17g02630 vs. NCBI nr
Match:
XP_023514696.1 (uncharacterized protein LOC111778917 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1045 bits (2701), Expect = 0.0
Identity = 534/564 (94.68%), Postives = 534/564 (94.68%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY
Sbjct: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV
Sbjct: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQKISGMLMEEGQND AFQMDVFWKEMEGD
Sbjct: 181 PLQVWVTQKISGMLMEEGQND-------------------------AFQMDVFWKEMEGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK
Sbjct: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSK 360
DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA DGTSPQKSSAVGSQALNFEKNQSK
Sbjct: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA-----DGTSPQKSSAVGSQALNFEKNQSK 360
Query: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL
Sbjct: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
Query: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF
Sbjct: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
Query: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 540
SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP
Sbjct: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 534
Query: 541 PSSSAAAQNRNERSFSLDPLWMHS 564
PSSSAAAQNRNERSFSLDPLWMHS
Sbjct: 541 PSSSAAAQNRNERSFSLDPLWMHS 534
BLAST of Cp4.1LG17g02630 vs. NCBI nr
Match:
KAG7026066.1 (hypothetical protein SDJN02_12564 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1031 bits (2665), Expect = 0.0
Identity = 528/564 (93.62%), Postives = 530/564 (93.97%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIF SGAA+AADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVP +
Sbjct: 1 MFGSRFSIFRSGAAAAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPQF 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSR ESLKDQIEMEARV
Sbjct: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRIESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQKISGMLMEEGQND AFQMDVFWKEMEGD
Sbjct: 181 PLQVWVTQKISGMLMEEGQND-------------------------AFQMDVFWKEMEGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHR FGSSLSFQSSAAKSGRFSMK
Sbjct: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRNFGSSLSFQSSAAKSGRFSMK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSK 360
DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA DGTSPQKSSAVGSQALNFEKNQSK
Sbjct: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA-----DGTSPQKSSAVGSQALNFEKNQSK 360
Query: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL
Sbjct: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
Query: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF
Sbjct: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
Query: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 540
SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP
Sbjct: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 534
Query: 541 PSSSAAAQNRNERSFSLDPLWMHS 564
PSSSAAAQNRNERSFSLDPLWMHS
Sbjct: 541 PSSSAAAQNRNERSFSLDPLWMHS 534
BLAST of Cp4.1LG17g02630 vs. NCBI nr
Match:
XP_022964483.1 (uncharacterized protein LOC111464493 [Cucurbita moschata])
HSP 1 Score: 1030 bits (2663), Expect = 0.0
Identity = 527/564 (93.44%), Postives = 529/564 (93.79%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIFGSGAA+AADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVP +
Sbjct: 1 MFGSRFSIFGSGAAAAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPQF 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSR ESLKDQIEMEARV
Sbjct: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRIESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQKISGMLMEEGQND AFQMDVFWKEMEGD
Sbjct: 181 PLQVWVTQKISGMLMEEGQND-------------------------AFQMDVFWKEMEGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDWVRANDIYDGIDEGYESSRDE SSVSSYNPMREPFHR FGSSLSFQSSAAK GRFSMK
Sbjct: 241 TDWVRANDIYDGIDEGYESSRDEFSSVSSYNPMREPFHRNFGSSLSFQSSAAKFGRFSMK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSK 360
DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA DGTSPQKSSAVGSQALNFEKNQSK
Sbjct: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA-----DGTSPQKSSAVGSQALNFEKNQSK 360
Query: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL
Sbjct: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
Query: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF
Sbjct: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
Query: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 540
SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP
Sbjct: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 534
Query: 541 PSSSAAAQNRNERSFSLDPLWMHS 564
PSSSAAAQNRNERSFSLDPLWMHS
Sbjct: 541 PSSSAAAQNRNERSFSLDPLWMHS 534
BLAST of Cp4.1LG17g02630 vs. NCBI nr
Match:
XP_022999897.1 (uncharacterized protein LOC111494236 [Cucurbita maxima])
HSP 1 Score: 1019 bits (2635), Expect = 0.0
Identity = 522/564 (92.55%), Postives = 527/564 (93.44%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIFGSGAA+AADKV+KSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVP +
Sbjct: 1 MFGSRFSIFGSGAAAAADKVQKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPQF 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSYVVRTTLPSRIPPSYK ATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV
Sbjct: 121 TKSYVVRTTLPSRIPPSYKSATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQKISGMLMEEGQND AFQMDVFWKEMEGD
Sbjct: 181 PLQVWVTQKISGMLMEEGQND-------------------------AFQMDVFWKEMEGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDWVRANDIYDGIDEGY+SSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFS+K
Sbjct: 241 TDWVRANDIYDGIDEGYDSSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSIK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSK 360
DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA D TSP KSSAVG QALNFEKNQSK
Sbjct: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA-----DVTSPPKSSAVGGQALNFEKNQSK 360
Query: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
DDDAGVPSSPR KTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL
Sbjct: 361 DDDAGVPSSPRLKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
Query: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF
Sbjct: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
Query: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 540
SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP
Sbjct: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 534
Query: 541 PSSSAAAQNRNERSFSLDPLWMHS 564
PSSSAAAQNRNERS SLDPLWMHS
Sbjct: 541 PSSSAAAQNRNERSLSLDPLWMHS 534
BLAST of Cp4.1LG17g02630 vs. NCBI nr
Match:
KAG6593727.1 (hypothetical protein SDJN03_13203, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 959 bits (2480), Expect = 0.0
Identity = 496/544 (91.18%), Postives = 502/544 (92.28%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIF SGAA+AADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVP +
Sbjct: 1 MFGSRFSIFRSGAAAAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPQF 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSR ESLKDQIEMEARV
Sbjct: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRIESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQKISGMLMEEGQND AFQMDVFWKEMEGD
Sbjct: 181 PLQVWVTQKISGMLMEEGQND-------------------------AFQMDVFWKEMEGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHR FGSSLSFQSSAAKSGRFSMK
Sbjct: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRNFGSSLSFQSSAAKSGRFSMK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSK 360
DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA DGTSPQKSSAVGSQALNFEKNQSK
Sbjct: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA-----DGTSPQKSSAVGSQALNFEKNQSK 360
Query: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL
Sbjct: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
Query: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF
Sbjct: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
Query: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 540
SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREK+E ++ V
Sbjct: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKTEVLMSNEVRIE 514
Query: 541 PSSS 544
SS+
Sbjct: 541 FSSN 514
BLAST of Cp4.1LG17g02630 vs. ExPASy TrEMBL
Match:
A0A6J1HKY7 (uncharacterized protein LOC111464493 OS=Cucurbita moschata OX=3662 GN=LOC111464493 PE=4 SV=1)
HSP 1 Score: 1030 bits (2663), Expect = 0.0
Identity = 527/564 (93.44%), Postives = 529/564 (93.79%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIFGSGAA+AADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVP +
Sbjct: 1 MFGSRFSIFGSGAAAAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPQF 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSR ESLKDQIEMEARV
Sbjct: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRIESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQKISGMLMEEGQND AFQMDVFWKEMEGD
Sbjct: 181 PLQVWVTQKISGMLMEEGQND-------------------------AFQMDVFWKEMEGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDWVRANDIYDGIDEGYESSRDE SSVSSYNPMREPFHR FGSSLSFQSSAAK GRFSMK
Sbjct: 241 TDWVRANDIYDGIDEGYESSRDEFSSVSSYNPMREPFHRNFGSSLSFQSSAAKFGRFSMK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSK 360
DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA DGTSPQKSSAVGSQALNFEKNQSK
Sbjct: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA-----DGTSPQKSSAVGSQALNFEKNQSK 360
Query: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL
Sbjct: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
Query: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF
Sbjct: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
Query: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 540
SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP
Sbjct: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 534
Query: 541 PSSSAAAQNRNERSFSLDPLWMHS 564
PSSSAAAQNRNERSFSLDPLWMHS
Sbjct: 541 PSSSAAAQNRNERSFSLDPLWMHS 534
BLAST of Cp4.1LG17g02630 vs. ExPASy TrEMBL
Match:
A0A6J1KC17 (uncharacterized protein LOC111494236 OS=Cucurbita maxima OX=3661 GN=LOC111494236 PE=4 SV=1)
HSP 1 Score: 1019 bits (2635), Expect = 0.0
Identity = 522/564 (92.55%), Postives = 527/564 (93.44%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIFGSGAA+AADKV+KSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVP +
Sbjct: 1 MFGSRFSIFGSGAAAAADKVQKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPQF 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSYVVRTTLPSRIPPSYK ATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV
Sbjct: 121 TKSYVVRTTLPSRIPPSYKSATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQKISGMLMEEGQND AFQMDVFWKEMEGD
Sbjct: 181 PLQVWVTQKISGMLMEEGQND-------------------------AFQMDVFWKEMEGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDWVRANDIYDGIDEGY+SSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFS+K
Sbjct: 241 TDWVRANDIYDGIDEGYDSSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSIK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQSK 360
DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA D TSP KSSAVG QALNFEKNQSK
Sbjct: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSA-----DVTSPPKSSAVGGQALNFEKNQSK 360
Query: 361 DDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
DDDAGVPSSPR KTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL
Sbjct: 361 DDDAGVPSSPRLKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTL 420
Query: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF
Sbjct: 421 TFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLF 480
Query: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 540
SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP
Sbjct: 481 SIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAP 534
Query: 541 PSSSAAAQNRNERSFSLDPLWMHS 564
PSSSAAAQNRNERS SLDPLWMHS
Sbjct: 541 PSSSAAAQNRNERSLSLDPLWMHS 534
BLAST of Cp4.1LG17g02630 vs. ExPASy TrEMBL
Match:
A0A1S4E0M0 (uncharacterized protein LOC103495739 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495739 PE=4 SV=1)
HSP 1 Score: 897 bits (2319), Expect = 0.0
Identity = 468/566 (82.69%), Postives = 490/566 (86.57%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGS+FSIFG+G +AADKVEKSAKS+ FPGLKLRSDKDVY PGDPVVVTIEI SSVP
Sbjct: 1 MFGSKFSIFGTG--TAADKVEKSAKSEFFPGLKLRSDKDVYRPGDPVVVTIEICSSVPQL 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSL IERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGE IFMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLLIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGEHIFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
KSYVVR+ LP+ IPPSYKGATIRYMY VKSTL+GRWL QEN RS KES DQIEMEARV
Sbjct: 121 MKSYVVRSMLPTCIPPSYKGATIRYMYCVKSTLVGRWLSQENCRSHKESPMDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQK +GMLMEEGQND AFQMDVFWKEME D
Sbjct: 181 PLQVWVTQKTNGMLMEEGQND-------------------------AFQMDVFWKEMESD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDW+RANDIY GIDEGY+SSRDEISSVSSYNPMREPFHRTFGSSLS QSSA GR S+K
Sbjct: 241 TDWIRANDIYAGIDEGYDSSRDEISSVSSYNPMREPFHRTFGSSLSLQSSA---GRSSIK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAV--GSQALNFEKNQ 360
+PF EGERLSLSSNVA P +SVAEVLY+S TD SPQKS A SQ LNFEKNQ
Sbjct: 301 IAPFIEGERLSLSSNVARPRVSVAEVLYES-----TDVASPQKSFAAVSPSQVLNFEKNQ 360
Query: 361 SKDDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGG 420
S DDDAGV +SPRPKT EPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGG
Sbjct: 361 STDDDAGVATSPRPKTIEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGG 420
Query: 421 TLTFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSF 480
TLTFFHEEG RRCLE+SI LETSETVSRRF+HPSRRNSPTIVKVQSDHYEVVADLIQTSF
Sbjct: 421 TLTFFHEEGTRRCLELSITLETSETVSRRFIHPSRRNSPTIVKVQSDHYEVVADLIQTSF 480
Query: 481 LFSIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVH 540
LFSIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPL+IEGREKSEW+LPI VH
Sbjct: 481 LFSIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLLIEGREKSEWVLPITVH 531
Query: 541 APPSSSAAAQNRNERSFSLDPLWMHS 564
APPSS+A AQNRN+R FSL+PLWMHS
Sbjct: 541 APPSSAATAQNRNDRPFSLEPLWMHS 531
BLAST of Cp4.1LG17g02630 vs. ExPASy TrEMBL
Match:
A0A6J1L132 (uncharacterized protein LOC111499409 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111499409 PE=4 SV=1)
HSP 1 Score: 868 bits (2244), Expect = 1.20e-314
Identity = 456/562 (81.14%), Postives = 481/562 (85.59%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIFG GAA A+KVE+S KS+V PG +LRSDKDVY PGDPVVVTIEI SSV
Sbjct: 1 MFGSRFSIFGIGAA--AEKVEESVKSEVLPGFELRSDKDVYRPGDPVVVTIEICSSVAQL 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSL IERLRFEIIGL+KLDAQWFSTQKPIPGS+QRRGE +FMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLLIERLRFEIIGLRKLDAQWFSTQKPIPGSRQRRGEHVFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSY VRTTLPS IPPSYKGATIRYMYYVKSTLLGRWL QENGRS KE LKDQIEMEARV
Sbjct: 121 TKSYEVRTTLPSCIPPSYKGATIRYMYYVKSTLLGRWLTQENGRSHKELLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQK +G+LMEEGQN+ AFQMDVFWKEM+GD
Sbjct: 181 PLQVWVTQKTNGVLMEEGQNE-------------------------AFQMDVFWKEMKGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
TDW+RANDIYD IDEGY+SSRDEISSVSSYNPMREPFHRTFGSSLS QSSA GR S+K
Sbjct: 241 TDWIRANDIYDCIDEGYDSSRDEISSVSSYNPMREPFHRTFGSSLSLQSSA---GRSSIK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAV--GSQALNFEKNQ 360
D+PF EGERLSL NVA P +SVAEVLYDSA D S QKS A SQAL+FEKNQ
Sbjct: 301 DAPFIEGERLSLFPNVARPRVSVAEVLYDSA-----DVASSQKSFAAVSPSQALSFEKNQ 360
Query: 361 SKDDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGG 420
DDD GV SSP PK EPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGG
Sbjct: 361 LTDDDVGVASSPMPKIIEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGG 420
Query: 421 TLTFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSF 480
TLTFFHE+GARRCLEVSI LETSETVSRRFVHPSRRNSPTIVKVQSDH+EVVADLIQTSF
Sbjct: 421 TLTFFHEDGARRCLEVSITLETSETVSRRFVHPSRRNSPTIVKVQSDHFEVVADLIQTSF 480
Query: 481 LFSIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVH 540
LFSIP++GPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPL+IE REKSEWILPI VH
Sbjct: 481 LFSIPINGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLLIETREKSEWILPITVH 527
Query: 541 APPSSSAAAQNRNERSFSLDPL 560
APPSS+A AQNRN+R F L+PL
Sbjct: 541 APPSSTATAQNRNDRPFPLEPL 527
BLAST of Cp4.1LG17g02630 vs. ExPASy TrEMBL
Match:
A0A6J1H364 (uncharacterized protein LOC111460073 OS=Cucurbita moschata OX=3662 GN=LOC111460073 PE=4 SV=1)
HSP 1 Score: 868 bits (2243), Expect = 1.76e-314
Identity = 454/562 (80.78%), Postives = 480/562 (85.41%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPPY 60
MFGSRFSIFG GAA A+KVE+S KS+V PG +LR DKDVY PGDPVVVT+EI SSV
Sbjct: 1 MFGSRFSIFGVGAA--AEKVEESVKSEVLPGFELRCDKDVYRPGDPVVVTVEICSSVAQL 60
Query: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
DCSL IERLRFEIIGL+KLDAQWFSTQKPIPGS+QRRGE +FMDCSVQSIVSNQIISSGA
Sbjct: 61 DCSLLIERLRFEIIGLRKLDAQWFSTQKPIPGSRQRRGEHVFMDCSVQSIVSNQIISSGA 120
Query: 121 TKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
TKSY VRT LPSRIPPSYKGATIRYMYYVKSTLLG+WL QENGRS KESLKDQIEMEARV
Sbjct: 121 TKSYEVRTMLPSRIPPSYKGATIRYMYYVKSTLLGQWLTQENGRSHKESLKDQIEMEARV 180
Query: 181 PLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEGD 240
PLQVWVTQK +GMLMEEGQND AFQMDVFWKEM+GD
Sbjct: 181 PLQVWVTQKTNGMLMEEGQND-------------------------AFQMDVFWKEMKGD 240
Query: 241 TDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSMK 300
DW+RANDIYDGIDEGY+SSRDEISSVSSYNPMREPF RTFGSSLS QSSA GR S+K
Sbjct: 241 ADWIRANDIYDGIDEGYDSSRDEISSVSSYNPMREPFLRTFGSSLSLQSSA---GRSSIK 300
Query: 301 DSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAV--GSQALNFEKNQ 360
D+PF EGERLSLS NVA P +SVAEVLYDSA D S QKS A SQAL+FEKNQ
Sbjct: 301 DAPFIEGERLSLSPNVARPRVSVAEVLYDSA-----DVASSQKSFAAVSPSQALSFEKNQ 360
Query: 361 SKDDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGG 420
DDD GV SSP PK EPVASEGFIRGRSYNIRVD+QVLLRFCPKNSDSTYYFSDMIGG
Sbjct: 361 LTDDDVGVASSPMPKIIEPVASEGFIRGRSYNIRVDEQVLLRFCPKNSDSTYYFSDMIGG 420
Query: 421 TLTFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSF 480
TLTFFHE+GARRCLE SI LETSETVSRRFVHPSRRNSPTIVKVQSDH+EVVADLIQTSF
Sbjct: 421 TLTFFHEDGARRCLEASITLETSETVSRRFVHPSRRNSPTIVKVQSDHFEVVADLIQTSF 480
Query: 481 LFSIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVH 540
LFSIP++GPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPL+IE REKSEWILPI VH
Sbjct: 481 LFSIPINGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLLIEAREKSEWILPITVH 527
Query: 541 APPSSSAAAQNRNERSFSLDPL 560
APPSS+A AQNRN+R F L+PL
Sbjct: 541 APPSSTATAQNRNDRPFPLEPL 527
BLAST of Cp4.1LG17g02630 vs. TAIR 10
Match:
AT1G50120.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Rgp1 (InterPro:IPR014848), Immunoglobulin E-set (InterPro:IPR014756); Has 144 Blast hits to 140 proteins in 61 species: Archae - 0; Bacteria - 0; Metazoa - 86; Fungi - 10; Plants - 39; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink). )
HSP 1 Score: 609.8 bits (1571), Expect = 2.3e-174
Identity = 323/565 (57.17%), Postives = 399/565 (70.62%), Query Frame = 0
Query: 1 MFGSRFSIFGSGAASAADKVEKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVP-P 60
M SRFS G G++S + + S + P L +++DKDVY PGD + VTIE+++S
Sbjct: 1 MLSSRFSFLGIGSSSEVNDSVGVSGSKIKPSLSVQTDKDVYRPGDSIFVTIEVANSHDNA 60
Query: 61 YDCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSG 120
+ S+ +E+L FE+ GL+KLD QWFSTQKP PGSK RRGE IF+D S S++SNQI+S G
Sbjct: 61 SNPSILVEKLSFEVKGLEKLDIQWFSTQKPSPGSKGRRGEHIFLDSSTPSLISNQILSPG 120
Query: 121 ATKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEAR 180
A + +VR LP IPPSYKGAT+RY+YY+KSTL GRW+ EN + K+S +D IE+E R
Sbjct: 121 AKMTLMVRAILPQIIPPSYKGATLRYLYYIKSTLCGRWMALENSQFYKDSTQDFIEVETR 180
Query: 181 VPLQVWVTQKISGMLMEEGQNDGSPYRYLVIEGLGFIFSKSLANMNIAFQMDVFWKEMEG 240
+PLQVWV QK +G+L+EE Q DG I S Q +++WKEM+G
Sbjct: 181 IPLQVWVIQKNNGLLLEEDQIDG-------------IVPTS------TIQTEIYWKEMDG 240
Query: 241 DTDWVRANDIYDGIDEGYESSRDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSM 300
D++W RAND YD ++GY+SSRDEISSVSSY P + +RTFGSSLS S R SM
Sbjct: 241 DSEWTRANDAYDSGEDGYDSSRDEISSVSSY-PNKSNLNRTFGSSLSLNSGP----RLSM 300
Query: 301 KDSPFTEGERLSLSSNVAHPGLSVAEVLYDSAGSVETDGTSPQKSSAVGSQALNFEKNQS 360
KD+ + E ER+ S + LS A V YDS TD +SP KS S ++ +
Sbjct: 301 KDTSYVE-ERVGSSPKMMLSQLSAAVVSYDSG----TDVSSPHKS----SNSVVPSQQPK 360
Query: 361 KDDDAGVPSSPRPKTNEPVASEGFIRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGT 420
+ + AG SP EPV SEGF RGRSYNIR+DDQVLLRF PKN+DSTYYFSD IGGT
Sbjct: 361 QTNGAGASMSPGAGAREPVPSEGFTRGRSYNIRMDDQVLLRFSPKNADSTYYFSDTIGGT 420
Query: 421 LTFFHEEGARRCLEVSIALETSETVSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFL 480
LTFFHEEG RRCLEVS+ LET ET++RRFVHPSRRNSPT+ KVQSDH+EVVADLIQTSFL
Sbjct: 421 LTFFHEEGTRRCLEVSVTLETLETINRRFVHPSRRNSPTLTKVQSDHHEVVADLIQTSFL 480
Query: 481 FSIPMDGPMSFSTPHVSLQWALRFEFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHA 540
FSIP DGPMSFSTP VS+QW LRFEF TTPK+VD +RYEHPL++ REKSEW+LPI VHA
Sbjct: 481 FSIPTDGPMSFSTPRVSVQWILRFEFLTTPKSVDLSRYEHPLLVPEREKSEWVLPITVHA 532
Query: 541 PPSSSAAAQNRNERSFSLDPLWMHS 565
PP ++ AQNR ++ F L+P W+ S
Sbjct: 541 PPPRTSGAQNRGDKLFGLEPSWIRS 532
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023514696.1 | 0.0 | 94.68 | uncharacterized protein LOC111778917 [Cucurbita pepo subsp. pepo] | [more] |
KAG7026066.1 | 0.0 | 93.62 | hypothetical protein SDJN02_12564 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022964483.1 | 0.0 | 93.44 | uncharacterized protein LOC111464493 [Cucurbita moschata] | [more] |
XP_022999897.1 | 0.0 | 92.55 | uncharacterized protein LOC111494236 [Cucurbita maxima] | [more] |
KAG6593727.1 | 0.0 | 91.18 | hypothetical protein SDJN03_13203, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1HKY7 | 0.0 | 93.44 | uncharacterized protein LOC111464493 OS=Cucurbita moschata OX=3662 GN=LOC1114644... | [more] |
A0A6J1KC17 | 0.0 | 92.55 | uncharacterized protein LOC111494236 OS=Cucurbita maxima OX=3661 GN=LOC111494236... | [more] |
A0A1S4E0M0 | 0.0 | 82.69 | uncharacterized protein LOC103495739 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1L132 | 1.20e-314 | 81.14 | uncharacterized protein LOC111499409 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1H364 | 1.76e-314 | 80.78 | uncharacterized protein LOC111460073 OS=Cucurbita moschata OX=3662 GN=LOC1114600... | [more] |
Match Name | E-value | Identity | Description | |
AT1G50120.1 | 2.3e-174 | 57.17 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |