Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGTGGAAGTGTGGTCTGTGGAGGAATACGGACCGCAGCCATTCAAATTCGCGGCCATGGCAGTTGCAGTGAGCTACTGCACCTCTTCCTCCTTTCTTGGTTACTTCCCCTCTGTGAGTTCTTTTCTTTTCGTAGTTCGTCCTCGCTTCTCTGATTTTGATGTTCTTGTGGTTGATTACTTGAACTTTTATCGTCAGGGATCGTTTTACGTTACTGTGGATGATTAGTTAAGCTTTTATCGTCAGAGTGTTCTGAATTCCTGGAAGCATTTCTGGTTATGGATTCAAGTTACTAAAAGGAATCCTTAACATAGTTATGTAGATTTGTTCTTTTGGGGGGTGTTCTTCGGTTTCGGAAGCCTTATATGCCATTAGACTCGAGAAATTTCTGGCACGGTTTAGTTAATTCTTTATACACATTCTTAATCTGATACTCCTCTCATTGGTAGGCTTGAAAGTTATTGCATAATCTCTATTAATCGAGGAGGGAAATTTTTAGACATGTTATCAATGTTACTCTCATTAGTGGACATGGAAATTAATAAAAAATTCACTAAGTGACTATCAACATTACTTTTTGGAGGGAATGGCATTATGGATTTGAAATTCCAACATGAATTCTGACTTTGATACTATGTCAAGATCATATTAAACTTAAAAAGGTTTGAGCTCGTTGAATTATGATATATTAAATCGTTTATACATGCGTTTAATGCTGCCATTGTTCTAATCATACATTTTGTTGCTAGTCGGGTGTTCCTTAAAAACACCCATTGACTTATTTGGACTCCAAATCTCATGCCAAAGGATTAAAATAACTGTCGGTTTGACATCGACAGTAAGGATCGATTGATATCTTATAAATATTCACGAAATGTAGAAAAGTATCATCAATTAGCTATCATTATGAATATAAATACTATTAATAAGTTCCATAACTTAGAAAAAAAGAACTTAACCATTAATTGAAAGTAATGTTTAGAGCTTTTTTAGTTTACAAGACTATCTGTCTATATCTTTGTAATAGTGAAATCTCCACATGGATATCAAAATTATTATGTTTTAAATCCTTGCTATGCCTTTGTGTGTTTTAAAATGTTTTGCTTCATGTCTGGGGAAAACTAAGGGATAACGTTTGGCAATGATGCTTCCACATTTGTTCTTATAGTTCACCTTTACATTGCTTCCAACTCTGAAAATGTTGCTTGTTAACCTGGTTTGGTGCGATATAAGAAATGCCTTGCTCAATTTATGGCTGAAAAAATGCACAAAACAAGTGTGAGAGCATGATCTACTACCCCTTCTGGCTCCTTGTGTTGCTTTATCAGTTTTCAGAATACTTCCGGTGGCATGTCCTGTGATTTTATTCATCTTGTTTGTCTGTAAAGAACTTTTTCCAATCTGTAGCATTTTAACGTCAAACAGATTAAGGATATGATGCTACATGCAACTCTTATGAATGGTTTTTTTGTAGTATATAAATAGAAAAACCTCAACTACTATATAATTGCAGAAAAAATTGAAGTCCAGACACTTGGTTAGCCATTCAGCTCAGGCAACTAGAATTACTGGGCTTTTTTGGGGAGCTAAAAAGTCGACAGTACCAAAAGAATTTGATTATTCATTGGGAGATTTTACTTTGACAGGGACAGGCCCAGAGGTATACTACTTGATTTTGAAGTTTACTTGTAATTTATAATCGATTATTTAGTTATTTGTTTGTTATCACTTAGTTGTTCCTTATTCAGGGAGGCTCAGTTTCCCATTCAAAACCTACAAAGTTGTCTCTTTCAGTTGTTTCGTCTATTTCAGAGGTTTCAGCCAGTGGCTGGGATGCTTGTGCCCTGGATAGTACTGGCCCTGAAAAATATAATCCATTTCTGACCCATGGATTTCTTTCAAGCCTGGAAGAGACAGGTTGTGCTGTGAAGGTGAGGAACTAAGGCCTCTTCAGAAGGAGGGTTAAAAATGTTTTGTTCACAAATGAAACAATCATCTGAGATTCCAACTTTGAAATTTTCCTAAGCAGGTTGCTCTTTAAACACAATATTCAATAAACAAAACATAATTAAAAATTAGACAATATAATTCCATACTGTATAGCACTAGTTTATGAAAATAACTACTTATGTACATTGAATCGTTGATATTGTCTTATAAACCAATTAGTTTTACCTTATCTTGGAAAGCTGCACTGTTGTCTTACTTTTAATCGAATTTCAGGAAACTGGATGGACGCCTCGCCACATTGTGGCTAAGGATGAATCAGAAAATATTTTATGTGTTGTTCCACTCTATCTTAAAAGGTTAGCATTGACTCTTTATTCATTTATCATTTCTCGATAGATTCTCTAGTTTTATACTACATTTTTTCATTCTACCTGTTGGGAAGATCTGCTGATTTTGATCTGTGGCTGCCTTTTCAACTTCTAACTTACTTTCAATCTGCAGTCATTCCTATGGAGAATTTGTTTTTGATCATTCCTGGGCTGATGCATACTACAGTATTGGTGGAAGATATTACCCAAAACTACAATGTTGTGTGCCTTTTACTCCTGTGACTGGTCCAAGGATTTTACTCCGTAACACAATATTCAGAGATGAAATCTTTGACATTATAGTTTCTGCTTTAAAGGATATGACAGCCAAGGTACGACTAGTCATATGATCGATGCTTTTAGTGTACATTATGCACGTCTATCTATATGGAAAATTAGTTGCGATCATTCGTATGATAATATGCACTTAATCTCTCTCTCTCTCTCTCTTTCTCTCAATATTGTTTACTATTAAACCTTTGAAGATAGCATCTTATTTGTCAGATGAATGTGAACGACTGTTCGAGAACTTGATCTTTGGAGCAATTCTGTACTATATTTTGTGCATGCTTGATGTAAATAATAATCGGTCTTGGGAGTCTGTTGGTGGGGTGGTGTTTTTTTCCCTGGGGGAAGGGACAATGCAAAAATATGGAAGTGTTATTTTTACCCTTGAGGTTACGATGACCCCACTATTTGTGTCGGCTTGCATTTGAACCTTTTTGCTATTTAGTTGATGGAAGTTTTGGTTTTGGAAAAAAGCAAGGATGGACATTTTATTTTGTCTCCTTTTGGGAAAGATTATTAGCAACAAAAAATGGGGAAAAAAACTTTGGTGTCCTTATGAATTTACATATTATAGCATCATATGGTGAAATTTTTGTTTGTGTGCTCTACATTTTTATTGAAGGAACCTATAGATAAGTCACATCTTTCATATCATTTAATAAAAGCATTTTATTTGGAGTTAATTACTTCGAACAGTCTCAGCTCTCGTCACTGCATATTACCTTCTCATCTGAAAACGAATGGCAAAAACTAAGTGACGGAGGATTTCTGCAAAGGATTGGAATGCAGTACCATTGGAAGAATCGTAATTACAAAGAGTATGTCGTTTGTTTATTTGTCTTTAATTGTATGAATACACATTTGATGATAGTAGTTTGTTTTGTTATAGTTCAATTTCTCTGTCAGTGAGTTCATAGAAAGGTTGATTCACATTAACATTAACATATTCAATGGAATGACAACATCAGTATGTTTTTAAGCTAATGCTTTCGGATGCCAACTTGTGACGCTCTTTCCCTCCCCCTCCTCCATTCCATGGCCGCTAGCTCCTGCACTGATCCCTTCCCCCCTGTCAACACCATGCCCGTCTCATACACAAAACGCTCTATAGCCATAAACCATAAAATCTTCTACATTTCAATTGGTTACTCCCATAAAGGAAACAGAGCCCGGATTATGCTCCTTCTCCATATCTGTTGGACCTCCCTAACATGGCTCTTCTCCATATTTGAAACTCTTGTGGCGACCCTTGCACCTACAAACTCTTCATGGAGAAGCGAAGTGCAATTACTTCCTTTGGATTGAAGAGTTTTGTATAAAAAAAGTATACTATGCAGATGCAACGAAACGAAACAACAATGGAGGAAGATTGCCCAACACGATACCAGCTGGAATGACAAAAAAGGATGGCTTTTCTTCTTGTCCCTCATATTGGACTTCTCTGGACGGGGCCTCAGACTTGCTCCATCCTCACCAACATATTATAAAGAGAGAGAACTGAAGGGCATCACCTCCATCTAATCCTCCACGTGCCCCTTTATTTTCCTTACTTCAAGCCTCATTAACACTAATAGGCCACCCACCCACTCTCCTGCCTGAGCAACCCATCCCCTTTCTGAACTGTAGTCTTTTTTAATTGGGAAAACATAATTATTATTCAAGAGAGATCCTTGCATCACAGGTGGTACCTCATCAGAGTGGCTATTCAGGATTCCCTTTAGTCCCGCTCTCAATCCCATACAACTGGTAAAGTCCTTCATTGTGGGGATGTAAAGCCCATCAACACCCTATGTTCACCTGACAACTGGTGAAAGGTTAGGGACTTCTCTTTGCAGTTCACATGTTGGATTCTTCCCATTAACCTTTGAATGAGAGGAACTATTGTATGTATGGAGACATTGCTGTGACTTAATAGTTAATGGATGTTGCCTAAAAATCTCTGCGATGGATTTGACAGAGCTTCAAATCAAATTGAGGAGAAACCCATTGGACTTTCTTTCAGCTGAGCTTGTAATCCCTTCTTCCATCGCCAGGAAAGATCAAATTGCTGGCGTTCATGGCCCATACCAAAGCAATTCGAGGGCCCCTCTGGAAGATTCAAATTTCTCTGATGAAAAGGACAAAAAGGAAAATTGGTGATAAAGGACAACTCCTCGGACCTGGTACATCCTGCAGCCTTCCTACACTTCCATTCATTGTCCTTTTGCTCAGAGATTTTGATCATTCTTGCTCACAGCTTCAGATGGTCAGCAACCCTATCAGTCAATATTCCAGATTTTTTATCTATCATCTTCATTGGGCATCCCTCCAAGAGTGAGAAGCAAACCCTCTGGCTAAACCTTGCTAGAGCTTTTATCTGGACCAAATCTTCTTATGATACTTTTTTTTAGCACTATTCTCCTAGTGGCTCTGTCTTATTGTTATCTCGTGCCACAAGGAGGGATAAGATTTAGATTATTTATTCATCAGTTCTTCTGCTTTAGTTTGAGATGATTCACTGGGTGGTTGTTTCTTCTTTCCCTTTTTTTAATTCTTAATTCTTCGTTCTTTATTTTATTTTAATAATTAGAAAGACAACATTTCCTTGAGCAAATGAACTGAGGAAAGCAGCAAACCTACAAAAATTCAATCAATGCTTCAAGATCTCTTTTTTATCAACTGTTAGGATTGTCATTACTTATTAGGTATAAAGTAATTAGCTGGTAGGTTTGTTAAGAATGGTGGTCATTGTTTATATGGGAATTTGGGAAATTAGGGATAGACCCATTATGACATTAGGTTGTACTTCGTCATAGCTTTCTGTTAATTGTAGACTATTAAGTGGTATGTTTGCAAGCATGTTTGCTTTTGTGAAGGTTGTGGATTGAACTTTAAGAGTATCCTTATAAATCTTTTCTGACTTAGTTAGTTGTTTCTTGTTTCTTTTAACTTAGTTGTTTTTCTGAAATCTCATTTGAATTAAATACCAAGTTTGGAGGCATCAATTGTTTGTCCATTTGAAGGTGAAAGTTGCCAACTTTGTTGTGACATTTGATGTGCAATGGTCAACCACTAACATTTTTTTATTTTTATTTATTTTTATTTTATATATATGTTTCCCAAGTGCAATAGATACTGAACATTTTCCCAGTATCATTTATAATTTCTTCTCCCTTCTATCTTCTGCAGCTTCAATGAGTTTTTAATGGATATGAAGCAAAGTAAGAGGAAAAATATTCGTCAGGAGCGCAAAAAGGTCAGCACTTTTAATTATCATTCCATTGTGTGGGATACCAAAATATCAAATCCACAGCAGAAGTTGATGGCTTTGGAGTCTCTATGATGATTTGAGTTAGACTAGAAATATAATTTTATTACTGGTTTAATCTGCCTGCTTTAGAGTGCAGTCTTGTTGAATGACCCTTTGTAACTAGGAGACCCTAAATGACGCTAGAATAGCTGCAATACCCTGTACTAACAAACTATTATGAATTAAGAGAGAGCTCAACGTCATTCGATTTGCTGAAGTCATAAATTGGATATGCATAGTTGCCCTACCAATGGGCTTTTATGGCTAAGATCTTGCATTAATGTTTATAGATGGTTGTGTTACTGATTATTACTTGTAGATTTGAATAGTATTGTAATTTCCCCAGCTTTTAGTGGCTATGAATTTGTAATTTATCTTTGTGGACTTTCTGTGGCAGATTGTGGCTCAAAATTTGACCATGAAACGTCTCCGGGGTAGTGAAATAAAGGTAAAAACTGGATCTGCTTGTATTTTATTAAAATTGGATAGGATTAAATAAATTTGATCTTTTAAAAAAGTTATCCTTATACTCGAGTGTCAATGAGTTCGCATAATAAAGGACAACGATAATTAATGATAACGAACCATCTCTTGTAAAATCTAGCTTATGAATTGTTGTTTGTGGCAACCATTACTTATCTTACTGCAAGAGAATTCGGGGCACGGGCATCATTTGTTGCAATGTTTTGGCAGGATAAGCATTGGGATTCATTCTATACATTCTACAGGAACACTACTGATAACAAGTTTGGTCCTTACTATCAACCCCTTTCATTTTTCTGTCTATTGGGATTAAGAATTTCTTTTTGTTGTTGCCATTTTTGGACCGAGTATGATGAATTGGTAGATGCATGAAAATATATTTAATTGTATTCGGTAATGGCTTCTTCATAGGTGGGGCACCCCTTATCTCACTCGGGATTTCTTTTATAATATGGCCTCTAAGATGGGAGATCAAGTGTTACTAGTTACTGCAGAGGAAGGTGATGAATTTGTTGCTGGAGCGCTCAATCTAATTGGTGGAGATACTCTATATGGGCGGCTGTGGGGTTGTCACCCTAGAGCATACTATCCAAGTTTACATTTTGAAGCTTGTTATTATCAGGTGTGTTCGTAATATCATTATTTCATTGCAGCAATATAATCGACATTTTCAAGTTGAGTGAAATAACCAAATCTATTAGTTCATTCTCTCTCTCTTTTCGTTTTGTTTGCTCTCAAGATTTTGTCTTAATAAAATATAATATATCACGTGTAACAGCCTAAGCCCACCGCTAGTAGATATTGTCCTCTTTGAGTTTTCCCTTTTGAGCTTCTCCTCAAGGTTTTAAAACACGTTTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATATTTTGTTTTCCTACCCAACTGACGTGGGATCTCACAATCCACCCCCCCTTCAGGGCCCAGTGTCCTCGCAGGCACTCGTTCCCTTCTCTAATCGATGTGAGACCCTCCAATCCACCCCCCTTTGGGGCCTAGCTCCCTTACTGGCACACCGCCTCGTGTGCACCCCCCTTCGGGCTCAGCCTCCTTGTTGGCACATCGCCTAGTGTCTGGCTCTAATATCATTTGTAACAGCCCAAGCTCACTGCTAGCAAATATTGTCCTCTTTGGACTTTCCTTTTCGGGCTTCCTCTTAAGGTTTTAAAATGCGTTTGCTAGAGAGAGGTTTTCACACCCTTATAAAGAATGTTTTGTTCGGCCAAAACATGAGTTTCTTTGCTCTAGGATCTCGCTGTTATCAGCTTATGGGACCTTGATGCTTATGGCCTTGTTCTGTATAATATTAGTTCATTTTTAAGTCGGAGAACAAAGCACTCCATCCAAATAATGTATACAAGGAAACCATCACTTCAACTTCTTAGCTGGTAGCAAACATAAAAGGAGAACATTTCCAAAAATAATTATGAATAGTACCTCCTAGAACGGTTCAAAAATGATTCTATCTCAAACCTCACCCAAGTCCTCCTACCCTTCAAAATCCTTACGTAATTATGATTACAGGCTAAATTTGACAATATCTGCTAGTGGTGGGCTTGTGCTGTTACAATTTCCATTTCAAACGTCCCCACTGAAGAGAGTTGAAACCATTACCCGTCATAAAAGTAGATTTATACTTGATAATGTCTCTGAACTTGGAGTTGCTTACTGTGAACCATCTCTTTTCTTTCTTGCGCTGAGGAACGTTGGTGCCAAAATGAGCTTTATTTATAATTGTGTTACTCTCATGCATTTTACTTAATTTCCTAAATAAAAGGTCCAAAAGTCATCAGAGTTTATAAATGCTGAAATGACCTCGAATCTTTGACCTTTGGTCGTTCCTCGCAGGCAATAGAAGCAGCTATTGAACTTGATCTCGACACAGTAGAAGCAGGAGCTCAGGGTGAGCATAAGATTCAACGAGGTTACATGCCTGTGACCACTTATAGCTGTCATTATCTTACGGATGAAGGTTTCGGGAGGGCAATAGATGATTTCCTAATGCGTGAAGCAAATCAGGTAACTCTATATATTCTGTGACTGATTGTGGCATAACGGTGTAATTCTTCTTCCAATGATTTATTGTTTAACTATAGACAGGAAGTTGCATCGTTTGCAAAAATCTAACTTGTGAGATCCCACGTCGGTTAGGAAAGAAAATAAAACATTTTTGAAAGGGTGTGGAAACCTCTCCCTAGCAGACACGTTTTAAAAACCTTGAGGGGAAGCCCGAAAGGGAAAGCTCAAAGAAGACAAAATCTACTAGCGGTGGGCTTGGGCGGGTTACATAACTTAGGGATATAACTTAGGGATAGTTACATGGAAAATAGAAAGAACTTTAAGAACATAGTTTGTTAAGTCCGTTTGATAACGTTTTTTCTCCAAATTTCTCAACTACAGTGAACATCGGAATTCCTAGTCAAACTTTATTTTAGTTTTCGTAACTTGTCTACAGACCGAAAGTTCTTATGGAGAGTTCGTTTTGTATATCCTAATTAATTGGTTCATTTAACCTTCAATCCATGGTTTGATTCAGGTTCTTCACATACCATTATCTACATGCTTCTGAATTGTTGGTTGATACAAAATGACTACTTTTGTTTGCAGGTTAAGGCTGTGATAAAACTATTGCATGATTCTGGACCCTTTAAAGAAGGTATATAAATTTAGGCCTTCCAAACACTGGAAGCAATAGGATCATCACATTTTATATAGATGATTATCAGATGATATGCTTCCTGATTGAATTTAGTGAAGGGTTAAAATGACTGTACAGAGACCAAGTTATGGCAACATAGCTATTTAAATGGAAAAGTTTTCAGCTATTTGCATGTTTTTCAGCTACGGCTGCATATTTTTTTG
mRNA sequence
CAGTGGAAGTGTGGTCTGTGGAGGAATACGGACCGCAGCCATTCAAATTCGCGGCCATGGCAGTTGCAGTGAGCTACTGCACCTCTTCCTCCTTTCTTGGTTACTTCCCCTCTAAAAAATTGAAGTCCAGACACTTGGTTAGCCATTCAGCTCAGGCAACTAGAATTACTGGGCTTTTTTGGGGAGCTAAAAAGTCGACAGTACCAAAAGAATTTGATTATTCATTGGGAGATTTTACTTTGACAGGGACAGGCCCAGAGGGAGGCTCAGTTTCCCATTCAAAACCTACAAAGTTGTCTCTTTCAGTTGTTTCGTCTATTTCAGAGGTTTCAGCCAGTGGCTGGGATGCTTGTGCCCTGGATAGTACTGGCCCTGAAAAATATAATCCATTTCTGACCCATGGATTTCTTTCAAGCCTGGAAGAGACAGGTTGTGCTGTGAAGGAAACTGGATGGACGCCTCGCCACATTGTGGCTAAGGATGAATCAGAAAATATTTTATGTGTTGTTCCACTCTATCTTAAAAGTCATTCCTATGGAGAATTTGTTTTTGATCATTCCTGGGCTGATGCATACTACAGTATTGGTGGAAGATATTACCCAAAACTACAATGTTGTGTGCCTTTTACTCCTGTGACTGGTCCAAGGATTTTACTCCGTAACACAATATTCAGAGATGAAATCTTTGACATTATAGTTTCTGCTTTAAAGGATATGACAGCCAAGTCTCAGCTCTCGTCACTGCATATTACCTTCTCATCTGAAAACGAATGGCAAAAACTAAGTGACGGAGGATTTCTGCAAAGGATTGGAATGCAGTACCATTGGAAGAATCGTAATTACAAAGACTTCAATGAGTTTTTAATGGATATGAAGCAAAGTAAGAGGAAAAATATTCGTCAGGAGCGCAAAAAGATTGTGGCTCAAAATTTGACCATGAAACGTCTCCGGGGTAGTGAAATAAAGGATAAGCATTGGGATTCATTCTATACATTCTACAGGAACACTACTGATAACAAGTGGGGCACCCCTTATCTCACTCGGGATTTCTTTTATAATATGGCCTCTAAGATGGGAGATCAAGTGTTACTAGTTACTGCAGAGGAAGGTGATGAATTTGTTGCTGGAGCGCTCAATCTAATTGGTGGAGATACTCTATATGGGCGGCTGTGGGGTTGTCACCCTAGAGCATACTATCCAAGTTTACATTTTGAAGCTTGTTATTATCAGGCAATAGAAGCAGCTATTGAACTTGATCTCGACACAGTAGAAGCAGGAGCTCAGGGTGAGCATAAGATTCAACGAGGTTACATGCCTGTGACCACTTATAGCTGTCATTATCTTACGGATGAAGGTTTCGGGAGGGCAATAGATGATTTCCTAATGCGTGAAGCAAATCAGGTTAAGGCTGTGATAAAACTATTGCATGATTCTGGACCCTTTAAAGAAGGTATATAAATTTAGGCCTTCCAAACACTGGAAGCAATAGGATCATCACATTTTATATAGATGATTATCAGATGATATGCTTCCTGATTGAATTTAGTGAAGGGTTAAAATGACTGTACAGAGACCAAGTTATGGCAACATAGCTATTTAAATGGAAAAGTTTTCAGCTATTTGCATGTTTTTCAGCTACGGCTGCATATTTTTTTG
Coding sequence (CDS)
ATGGCAGTTGCAGTGAGCTACTGCACCTCTTCCTCCTTTCTTGGTTACTTCCCCTCTAAAAAATTGAAGTCCAGACACTTGGTTAGCCATTCAGCTCAGGCAACTAGAATTACTGGGCTTTTTTGGGGAGCTAAAAAGTCGACAGTACCAAAAGAATTTGATTATTCATTGGGAGATTTTACTTTGACAGGGACAGGCCCAGAGGGAGGCTCAGTTTCCCATTCAAAACCTACAAAGTTGTCTCTTTCAGTTGTTTCGTCTATTTCAGAGGTTTCAGCCAGTGGCTGGGATGCTTGTGCCCTGGATAGTACTGGCCCTGAAAAATATAATCCATTTCTGACCCATGGATTTCTTTCAAGCCTGGAAGAGACAGGTTGTGCTGTGAAGGAAACTGGATGGACGCCTCGCCACATTGTGGCTAAGGATGAATCAGAAAATATTTTATGTGTTGTTCCACTCTATCTTAAAAGTCATTCCTATGGAGAATTTGTTTTTGATCATTCCTGGGCTGATGCATACTACAGTATTGGTGGAAGATATTACCCAAAACTACAATGTTGTGTGCCTTTTACTCCTGTGACTGGTCCAAGGATTTTACTCCGTAACACAATATTCAGAGATGAAATCTTTGACATTATAGTTTCTGCTTTAAAGGATATGACAGCCAAGTCTCAGCTCTCGTCACTGCATATTACCTTCTCATCTGAAAACGAATGGCAAAAACTAAGTGACGGAGGATTTCTGCAAAGGATTGGAATGCAGTACCATTGGAAGAATCGTAATTACAAAGACTTCAATGAGTTTTTAATGGATATGAAGCAAAGTAAGAGGAAAAATATTCGTCAGGAGCGCAAAAAGATTGTGGCTCAAAATTTGACCATGAAACGTCTCCGGGGTAGTGAAATAAAGGATAAGCATTGGGATTCATTCTATACATTCTACAGGAACACTACTGATAACAAGTGGGGCACCCCTTATCTCACTCGGGATTTCTTTTATAATATGGCCTCTAAGATGGGAGATCAAGTGTTACTAGTTACTGCAGAGGAAGGTGATGAATTTGTTGCTGGAGCGCTCAATCTAATTGGTGGAGATACTCTATATGGGCGGCTGTGGGGTTGTCACCCTAGAGCATACTATCCAAGTTTACATTTTGAAGCTTGTTATTATCAGGCAATAGAAGCAGCTATTGAACTTGATCTCGACACAGTAGAAGCAGGAGCTCAGGGTGAGCATAAGATTCAACGAGGTTACATGCCTGTGACCACTTATAGCTGTCATTATCTTACGGATGAAGGTTTCGGGAGGGCAATAGATGATTTCCTAATGCGTGAAGCAAATCAGGTTAAGGCTGTGATAAAACTATTGCATGATTCTGGACCCTTTAAAGAAGGTATATAA
Protein sequence
MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDFTLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSSLEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRYYPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQKLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGSEIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALNLIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMPVTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI
Homology
BLAST of CmoCh16G008230 vs. ExPASy TrEMBL
Match:
A0A6J1ENA9 (uncharacterized protein LOC111436048 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111436048 PE=4 SV=1)
HSP 1 Score: 962.2 bits (2486), Expect = 7.8e-277
Identity = 465/466 (99.79%), Postives = 466/466 (100.00%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEG+
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGM 466
BLAST of CmoCh16G008230 vs. ExPASy TrEMBL
Match:
A0A6J1EUJ6 (uncharacterized protein LOC111436048 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436048 PE=4 SV=1)
HSP 1 Score: 960.7 bits (2482), Expect = 2.3e-276
Identity = 464/466 (99.57%), Postives = 465/466 (99.79%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKE +
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEAV 466
BLAST of CmoCh16G008230 vs. ExPASy TrEMBL
Match:
A0A6J1J3K4 (uncharacterized protein LOC111483077 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111483077 PE=4 SV=1)
HSP 1 Score: 949.1 bits (2452), Expect = 6.8e-273
Identity = 460/466 (98.71%), Postives = 463/466 (99.36%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYC SSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCNSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSAS WDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASDWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDI+VSALKDMTAKSQLSSLHITFSSENE Q
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDILVSALKDMTAKSQLSSLHITFSSENESQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDF+EFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFDEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFL+REANQVKAVIKLLHDSGPFKEGI
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLVREANQVKAVIKLLHDSGPFKEGI 466
BLAST of CmoCh16G008230 vs. ExPASy TrEMBL
Match:
A0A6J1J6S9 (uncharacterized protein LOC111483077 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483077 PE=4 SV=1)
HSP 1 Score: 948.0 bits (2449), Expect = 1.5e-272
Identity = 459/466 (98.50%), Postives = 463/466 (99.36%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYC SSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCNSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSAS WDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASDWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDI+VSALKDMTAKSQLSSLHITFSSENE Q
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDILVSALKDMTAKSQLSSLHITFSSENESQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDF+EFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFDEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFL+REANQVKAVIKLLHDSGPFKEG+
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLVREANQVKAVIKLLHDSGPFKEGM 466
BLAST of CmoCh16G008230 vs. ExPASy TrEMBL
Match:
A0A6J1JA14 (uncharacterized protein LOC111483077 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483077 PE=4 SV=1)
HSP 1 Score: 944.9 bits (2441), Expect = 1.3e-271
Identity = 458/464 (98.71%), Postives = 461/464 (99.35%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYC SSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCNSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSAS WDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASDWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDI+VSALKDMTAKSQLSSLHITFSSENE Q
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDILVSALKDMTAKSQLSSLHITFSSENESQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDF+EFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFDEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKE 465
VTTYSCHYLTDEGFGRAIDDFL+REANQVKAVIKLLHDSGPFKE
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLVREANQVKAVIKLLHDSGPFKE 464
BLAST of CmoCh16G008230 vs. NCBI nr
Match:
XP_022929496.1 (uncharacterized protein LOC111436048 isoform X2 [Cucurbita moschata])
HSP 1 Score: 962.2 bits (2486), Expect = 1.6e-276
Identity = 465/466 (99.79%), Postives = 466/466 (100.00%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEG+
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGM 466
BLAST of CmoCh16G008230 vs. NCBI nr
Match:
XP_022929495.1 (uncharacterized protein LOC111436048 isoform X1 [Cucurbita moschata])
HSP 1 Score: 960.7 bits (2482), Expect = 4.7e-276
Identity = 464/466 (99.57%), Postives = 465/466 (99.79%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKE +
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEAV 466
BLAST of CmoCh16G008230 vs. NCBI nr
Match:
KAG6577356.1 (hypothetical protein SDJN03_24930, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 954.9 bits (2467), Expect = 2.6e-274
Identity = 464/466 (99.57%), Postives = 464/466 (99.57%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAK LSSLHITFSSENEWQ
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAK--LSSLHITFSSENEWQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 464
BLAST of CmoCh16G008230 vs. NCBI nr
Match:
XP_023552005.1 (uncharacterized protein LOC111809806 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 954.5 bits (2466), Expect = 3.3e-274
Identity = 461/466 (98.93%), Postives = 464/466 (99.57%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYC SSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCNSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLY+KSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYVKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSE EWQ
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSEKEWQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDF+EFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFDEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFL+REANQVKAVIKLLHDSGPFKEGI
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLVREANQVKAVIKLLHDSGPFKEGI 466
BLAST of CmoCh16G008230 vs. NCBI nr
Match:
XP_022984967.1 (uncharacterized protein LOC111483077 isoform X3 [Cucurbita maxima])
HSP 1 Score: 949.1 bits (2452), Expect = 1.4e-272
Identity = 460/466 (98.71%), Postives = 463/466 (99.36%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
MAVAVSYC SSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF
Sbjct: 1 MAVAVSYCNSSSFLGYFPSKKLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLGDF 60
Query: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFLSS 120
TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSAS WDACALDSTGPEKYNPFLTHGFLSS
Sbjct: 61 TLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASDWDACALDSTGPEKYNPFLTHGFLSS 120
Query: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY
Sbjct: 121 LEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGGRY 180
Query: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENEWQ 240
YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDI+VSALKDMTAKSQLSSLHITFSSENE Q
Sbjct: 181 YPKLQCCVPFTPVTGPRILLRNTIFRDEIFDILVSALKDMTAKSQLSSLHITFSSENESQ 240
Query: 241 KLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
KLSDGGFLQRIGMQYHWKNRNYKDF+EFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS
Sbjct: 241 KLSDGGFLQRIGMQYHWKNRNYKDFDEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLRGS 300
Query: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN
Sbjct: 301 EIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGALN 360
Query: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP
Sbjct: 361 LIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGYMP 420
Query: 421 VTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
VTTYSCHYLTDEGFGRAIDDFL+REANQVKAVIKLLHDSGPFKEGI
Sbjct: 421 VTTYSCHYLTDEGFGRAIDDFLVREANQVKAVIKLLHDSGPFKEGI 466
BLAST of CmoCh16G008230 vs. TAIR 10
Match:
AT2G23390.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF482 (InterPro:IPR007434), Acyl-CoA N-acyltransferase (InterPro:IPR016181); Has 2165 Blast hits to 2163 proteins in 543 species: Archae - 0; Bacteria - 1044; Metazoa - 0; Fungi - 0; Plants - 33; Viruses - 0; Other Eukaryotes - 1088 (source: NCBI BLink). )
HSP 1 Score: 669.8 bits (1727), Expect = 1.5e-192
Identity = 324/468 (69.23%), Postives = 382/468 (81.62%), Query Frame = 0
Query: 1 MAVAVSYCTSSSFLGYFPSK--KLKSRHLVSHSAQATRITGLFWGAKKSTVPKEFDYSLG 60
MAV +SYC S L F + L + V S +++ +T +FW + K KEFD SL
Sbjct: 1 MAVVLSYCKPSPPLRRFHRRCGNLWKKENVRSSRRSSSVTAMFWKSNKPAEVKEFDISLR 60
Query: 61 DFTLTGTGPEGGSVSHSKPTKLSLSVVSSISEVSASGWDACALDSTGPEKYNPFLTHGFL 120
D+TLT + E + K +SLSVVSSI E+ + WDACALDS+ PE YNPFL++GFL
Sbjct: 61 DYTLTESNIEEALENKPKQKVISLSVVSSIFEIPQAEWDACALDSSQPESYNPFLSYGFL 120
Query: 121 SSLEETGCAVKETGWTPRHIVAKDESENILCVVPLYLKSHSYGEFVFDHSWADAYYSIGG 180
SSLE+TGCAV+ETGW P HIVAKDE E+IL VVPLYLKSHSYGEFVFDHSWADAY S GG
Sbjct: 121 SSLEDTGCAVRETGWMPLHIVAKDECESILGVVPLYLKSHSYGEFVFDHSWADAYRSFGG 180
Query: 181 RYYPKLQCCVPFTPVTGPRILLRNTIFRDEIFDIIVSALKDMTAKSQLSSLHITFSSENE 240
RYYPKLQCCVPFTPVTGPRIL+R+ ++++FD IVSA+ ++ +K Q+SSLHITF S E
Sbjct: 181 RYYPKLQCCVPFTPVTGPRILIRDNPCKEQVFDAIVSAMTELASKLQVSSLHITFPSGAE 240
Query: 241 WQKLSDGGFLQRIGMQYHWKNRNYKDFNEFLMDMKQSKRKNIRQERKKIVAQNLTMKRLR 300
W KL + GF QRIGMQYHWKNR+YK+F+EFLMDMKQSKRKNIRQERKKI QNL M+RL+
Sbjct: 241 WDKLKEKGFSQRIGMQYHWKNRDYKNFDEFLMDMKQSKRKNIRQERKKIGTQNLKMRRLQ 300
Query: 301 GSEIKDKHWDSFYTFYRNTTDNKWGTPYLTRDFFYNMASKMGDQVLLVTAEEGDEFVAGA 360
G +IK +HWDSFY FYRNTTDNKWGTPYLTRDFF++MASK+GD+VLLV AEE +E VAGA
Sbjct: 301 GDDIKARHWDSFYDFYRNTTDNKWGTPYLTRDFFHDMASKLGDKVLLVLAEENEEPVAGA 360
Query: 361 LNLIGGDTLYGRLWGCHPRAYYPSLHFEACYYQAIEAAIELDLDTVEAGAQGEHKIQRGY 420
LNLIGGDTL+GRLWGC P +YYPSLHFEACYYQAIEAAIEL+L TVEAGAQGEHKIQRGY
Sbjct: 361 LNLIGGDTLFGRLWGCRPDSYYPSLHFEACYYQAIEAAIELNLKTVEAGAQGEHKIQRGY 420
Query: 421 MPVTTYSCHYLTDEGFGRAIDDFLMREANQVKAVIKLLHDSGPFKEGI 467
+PV TYSCHY+ DEGF +AID+FL+RE+NQ+ VI+L+H+ GPFKE I
Sbjct: 421 LPVKTYSCHYIFDEGFRQAIDEFLVRESNQMDYVIRLMHEDGPFKEKI 468
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1ENA9 | 7.8e-277 | 99.79 | uncharacterized protein LOC111436048 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1EUJ6 | 2.3e-276 | 99.57 | uncharacterized protein LOC111436048 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1J3K4 | 6.8e-273 | 98.71 | uncharacterized protein LOC111483077 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J6S9 | 1.5e-272 | 98.50 | uncharacterized protein LOC111483077 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1JA14 | 1.3e-271 | 98.71 | uncharacterized protein LOC111483077 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_022929496.1 | 1.6e-276 | 99.79 | uncharacterized protein LOC111436048 isoform X2 [Cucurbita moschata] | [more] |
XP_022929495.1 | 4.7e-276 | 99.57 | uncharacterized protein LOC111436048 isoform X1 [Cucurbita moschata] | [more] |
KAG6577356.1 | 2.6e-274 | 99.57 | hypothetical protein SDJN03_24930, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023552005.1 | 3.3e-274 | 98.93 | uncharacterized protein LOC111809806 [Cucurbita pepo subsp. pepo] | [more] |
XP_022984967.1 | 1.4e-272 | 98.71 | uncharacterized protein LOC111483077 isoform X3 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT2G23390.1 | 1.5e-192 | 69.23 | CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF482 (InterPro:IPR0074... | [more] |