Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCACTTCGCGATTCCTTCTCGTTCTCCATCTTCAGCAAATTAACCACTCGGAGGCGTCGATTTCAAATGATGCTCCACGAATCCCTACTTCGCCTGCTACAATTTCTCCCTCCTCTAATGCTAGGGCTTCGGCGATGTCTGCATCCAGGTACGCATAGCGACATTTATGCTTGGATTTGTATGCATAGCGACAGACGATTTGATCTCGTATTTTTACATGGATTGGAATTCCTGTTTATTTATTTATTTCTTAGTACCTCGTGGTTGCCTCTGCACTTCTGTGTAATTTTCCAAATAATTTTCATAGTTCTTCAAATCTGTAGTTTAGTTTGGCTAATCGGAATAGTAGACAGGAGATTCGTTTCAGGCCTTTTTTACTCACTAGGAGTAGGTTTATTCCATTGAAGAGGAATATGTCATATTTTTTAATCTCCTTCTTTAAATCATTGGTATAGCTACAGAATCCTCTTCCAGATTCGTTTTGAAAGGAGCTTTCATTCCGGGAGAAATTATTAAAAAAAAAAAAATTAGAGTTTTGTTTACAGTTTTCTTTTTATTTCGAAAAACATAATCATTTTATTCTAGTATTATAAGAAATTATAAAAACTCAGATAAAGAAGAAGAATAGAAGCCTAATAAAGTTAAAATGATGTAATTTGAAGTATTAATTCTTTGAAAAAAAAAATAATAAAAAATAAAAATTGAAAAAGTAAAAAAGCAAAAACGAAATAAGAATGTGCTGCAATTTTATCTGACATGTGAAACTGAAACAAGTACATGTAATCTTGAATTTGTCAAGACTTTCTTCAAACTTTCGTTTGGAATACTAAAAAAAAATACAAAAAATCACTCCATATATCAGAGCCAGGTTTCGATCCTGGGACCTGTGGTTACGGCCCACCACGCTTCCGCCACTCTGATTTGAATATTATATAATTAATTTTTTATGTATTAATAAATTAAAAAAAAATGAGAGAGAAATAATTGATGGTCTCGATCTATGATCATAGTTTTCAGGATTTCATTTCAAATCTGTTCTCCTGCTTCCTTTTCAGTCGTTCAGTTTGTTGTATTGGCCTTCTTCTCTTTCTCATTTGTGACTTGTGACTCCATTGCTGTGTTTGTGCTATGTTCTATCATTGTTACTTTTTTATACTTTCAGATGTTCGTGTTATAGCTGGTGATAGGAACCAAAACAAAACAATTTAATTTGTATATGTATCAGTGTTAAGGTATTTATAAAAGAATAAATTTATCATAATCCATCCGCCTAAGCTTTTGAGTTTATTCTACTAAAAAACTTTTGGATTTATTGGTGATTTAACATAGTGCCAGAGAGTCCAATAATTTCATTTCCTTCCCATTTCATATTTATTACCACTTGTTGGTCCTTGTGCAAATTTCTAAGCCGGCCACTAATGAAACTAATGAAGGAAGTGTTAAGGTATTTATAAAAGATTAAATTTATTGATTGATGATTTAACAATAAACACATATATTTGAGCTTGCAAGCTCCTCTGCCTAAAACCCCTCCCAACTTACTCTATTTGTAACCACTATATTAACAAACTCCTTAATTAATGGTTAATATGCTGATATAATATCCTAATAATATTTCTAACGTTTTCCTAGAAGCTGCTTGCTTGTTCCTCTACTCGTTAAGTTAATTGCAGTTTTATGGTGTGTGTGTTAGCAGGCCAGAAGAGTGGCAGAGTTACTATCCTGAATTGCATTAGAAGAACAGTCTGTCTTCATAGTCCTCTGCCTTATATACATTTCTATTGTACCATTGAAGAGATTAAGTCTCCTTAGACTTCCCAAACTTGTACATTTATGGAAGACAAACTTGCAAGCAACCCATCATTCCTCAACTTAGAAGTCCTCAAGGTAGAAGAATGCGGTCGATTAAATCTTTTATTTCCGTCGTCATTCTCCCTCCACAACTTGAAGTCTATCAGAATAGTTCCCTGTCATGGATTGGTTCACTTGATGAATTCATAAGTAGCTAAAACCCTGGTGAAACTGCAAGAGATGCGTTTATCTTCTTGCAAAAAGATTCTACCATAATTGCAAAAGGAGAAGGAGGTGAAGAAGAGGATGAAATCGTTTTCAGCCAATTGAAGATTTTGGAGCTCTTCAACTTACCCAATTTATCAACCTTTCATTTTGGGAAAAGCAGTTTTAAGTTGCCGTGCCTGGGAAAAGTGGTTGCGAAGAAATGCCCTGAAATGAAAGCATCTTCTGGTGGAGTTGTGAGCACGCCTGAACATTACTGGTATGTCAAATGTGGATCAGACGAGGGATTTTGGACAAGCAACGTCAATGCCACCATCAACCGGCTATGGTATGATGGCCATGACACTAGCCTTCAGAGTTTGTTTATTGAACAGGTATGTGTCTATTGATTCTACAAATTTTTATTGCATGATTTTACTTCTATGCATAAATCATTTGTATGATTTTACTTAGGGGGGTGAGAAACTGGTTTGGTCGGGTGGGATCGGTTTTCATGTAAAAGGAGAAAACCGACTCGGTTTGATTTAGTGGTGAAGAAACCGAACTGATTCAGGTCGCTTTATGGCTGGTTTTTCTTTTACTTATGTTGAATTGAATTGGAAAACTAAGTGGTATCACGGGATATAGTCCTAATAGATATTTTTTTTTTCAAATGTACAAAAATCAACTTTTAATATCAGAGCCAAGTTTCGATCCTGGGACCTGTGGGTTATGGGCCCACCACGCTTCCGCTGCGCCACTCTGATCTGAATATTACATTTTTATTATTTTACTTTGTCTAACAAACCTAAACAAAATTAAAAAAAAAAAGAAATAATTTATGGTCTGGATCTATGATCACAGGTTATAGGATTTCAATTTAGATCTGTTCTGCGTTCCTTTTCAGTCGTCCAGCTTGTTCTATTGGCCTTCTTCTCTTCCTCATTTTTGACTTGTGACTCCATTGCTGCGTTTGTGCTACTATCCTTCTTACTTCTTATGGTGCATTGGCTGCGTGTTCAAACTGATTTTTCTTTTTTTGTTTTCTTTACTCAAGCAAACGATCTTTTTATAGTACTTTCATCTGTGCATGTTCATGTTATAGTTGGTAATAGGAACAAAACAAAACAATTTTATTTTGTAATTTGTATATATATCTATGTTAAGGAATCTATAGAAGATTAAATTATCCGTTAACTTAAGATTTTGGTTTGATTAGTAATTTAACATAATATTAGAATAAGATGTCCTATGGTATAATCTCTGGTTGTAATGTCACATTTTTCCTTCATTTAATCTTAATTAATTTAACATTAAACCTACATATATGTACCAGAGTTAGCAATGCGTACAAATTTTTTAGTTTTATTTTAAAGTATAAAAACTATAAAAATGAATTAGCTTTTTTTTTTTTACTAGAAAAATGAATAAGTTAGTTAGAACTTATAATATAAGTTAATTTCATAAAAAATCAAGAGGTCAATGTCATAATTAAATTAAAATTAATTATCAGAATAATACCAATTCAAAAAGCACATCTATTTGATAATAATTTTTCATTTTTCTAATAATCAAAATATCTAATTGTGAAATGGTTCCATCCTCCCAAAAACAAGTTTCGGAGTCGTGGAACATTTTAAAAACTTGGAACAGAACGGGTAACCTGAAGGGATAGTTTTTCAACGGCTCATTATGGATGTGCCTTTAACCCACAAATTGTTGCGTATTCAAAAGAATGAGGCCTATGCCTAGCCCTATGTCACAAGGATTTTGTTTCAAAAGGAAATGCCACAGCCACTTATCTAACATAGAGACCTTATTACGAAGTTTTGTTGGGTTTGTAAGGTGTGCAAACAAAGTTCCACATTGGTCCGGGAGTGATTCATGGTATATAAGGAAGAACAATTATATCTATTGGTATGAGACGTTTCAGGTGAAACCAAAAGCAAAGTCGTGAGGGCAAAGTTTTAGGTCTTTCATAAATAACAAAACAATTCCAATAATTTTGTAGTAAGAACAAAGAAAAAGTTTTAGTACGAAGAGTTAAATATAATATTGATGTAAATTGAGATATGCTTAATTACGTTGTTACTTTTGTATGCTCATAATGATTATAAATCCTAAGATAGAAAAGTTAGGCTCATTGTTCATTTGATTGACTTGATTAACAGAAAAATGTTACTAAATCTTAATCACGTTTATTATAATAATTATTATTATTAACGAGAGTGTTTTTTTTTCAGGATTTTCAACCAACCATTCAACTACTCTTTCATCAAAATTTTTATTGTTATAATCCAAGCAGCGTATAAGAATGTACAACAAACGATTGAGATAATTGATCACCCGCGAAAAAGTCGACAATGAAAAGCAAAAATGAAAGGAGAGACTAGAGAGAAGATAAATATCATTACGTTGGCTTTGCAAAGGGAAATCAAGTGAAATAAGATACCAACATGAGTATACTTCCCGAGGCACCTTACTCTTTCTTTCGGGATCTCGATCATCTATCTCTACATTTGTTGTATTAAAAAAGTAAAATAAAAGATGAGATTCTTTGAATTAGTAGCAATGTTGATAGGAGGACCATTGGTCTTAAAACAAAAAGAAGAAAAAAAAATCAAGTTTTTTATTTATTGTTTCCATTTTTTTTCCTCAAGATAAAAAAAATTATACTTATAAAAAAATATTGAAAACATTACTTCTCCAACGATAGTTAATACCGGTAAGATAATTATTACCATTAGAGCATGTTGAAAAGTACTTTTGAAGTGCTTCAAAAATTATGTGTAGATTTTCTAGCAGTTTGAATTGTAGAATTATTTTAAGATTTGTTTTATGAAATCTGAAGCCATGCACTGCCAAAACTTATGCATCTGTGGAAGGGAAGGTCACATAAGAAAGCCCCATTGCCTCAAAGTTTGGAATAATTTTATAGAGACAATTGTTTTGGACTTGGACAGGCAAATTAATAATTACTTGGATTTAGAACTGAAGACTAGATATGAACTGTTAGTGTCAGAATTTGAATTGGTGGAGCAAAGTTGATATAAGAAAAGCAACATGATAGAATATATTGTCTTTATGATAAATAAGTCACTTTGGTGGAAATAAGAGATGAGGTTCTTTGAATGGGTGGGGCAAATTAGTTGATAAGAAAGCAAAATATGACGAATATTTGGCATCATGAAATATGCCATTTTATGCCAAAACAATTGTTTTCGACAGAGTTGAATTTAGAAAAGTTCGACATTGCTATCGGTATTGAATCTAGGATGGTTATGCACAAATTAGGGTTGTCTAGAACGTTGTATCTGAAGATGGAATCAGGAAGTTGTTTGGATGATTGGATAGAAATACTGCTAAAGAGGTGTGAAGAGCTGCAATTAGTAGGATCAATTGGTGCAAGAGTTCTAACCTTTGAGTTAGTTGAAAATGAGTGTTCACATTTGAAGCATCTCCACCTTTTTAAAAATCGAGAATTTCAACATTTAATCCACCAACAGAACAAGCCTTTACGAAAAACTTTATCCAATTTGGAGGACCTACAACTTCGCTGTTTGGAGAATTTGGAAAGTATAATTGATGGGCATGGGCATGTCACAGAACTTGCTTTCAACAAGTTGAGGAGTGTAGATGTGGAGTATTGTCATAAAATGGGAACTCTGTTTTACAACTGCATGGTGGATGACATTTTGAATCTTGAAGACATTTTTATTTATGGGTGTGAGATGTTGGAATATTTGATCACTGTGACGATGGAAAGCAAAGAGACAACCGCCACTATTGAGTTTCCGCACTTGAAATCTTTAGAGCTACGTTTCGTACAACGGGTTCGAGGTTTCTGCTCCAAAATGGATCAAATTGGCAATGAGAGTTCATTCTTCAGTGAAGAGGTAAATTTCAACGTCATTTTCACAATACACCTAACTTTTTCTTCTTACTGTCGTTTCTTCTCTTATAATGGCTCTGCATTTTTTTTTTTTACTTCCTTTTAAAATTTATAGTTTAAATTTTAGTTAAATTAAAATTTTGGTACCTATAATTTTTATAAATTTCAATTTGGTTCCTATTGTTATAAAAATTGTAATTTTTGTTTTGGGTTAGTTTCTACGAGATTTGGGTTTAGTTTCAATTTAGTTACAACTTCAAAAATCATCCACCCATCACCACATTGACACTTTTTTATGCAACTATCAAATAATGTTCTCTATTTGTCTAACAGTATTAGTTGGGTTCACTAAAAAAAATGTATTATTTGGTTGTATTAGAATTTAGAGTTGAGTAAAAAAAAATATAATATAATCTATATACGAATTTTTTGAAGGGGTGAAACACAATTTATAGATGATATATGAGATTTAACCAAATTGTAGGGTTAAAAAAATTAGAATTAATTCCGAACTATAAATTAAAATTTCTAAAACTATAAAGACTAAATAGAAACTATATCGACTAGTTAGAAACTTCTAAAATTATAAGGATAAAATTGAAACTATATCCCAAACCACAGGGACCAAAATTACAATTTAACCTTAAATTTCTTCGCTTACACTCTCTAACTTATTGTTGATGCAGTTAATTTTTACAATTGTAATTATGAAAATTAATGTTTTTCTTTTTATTTATCACTTATCAATTTATCCAGTAAACTACTATGGGGCTCATGTAGCCCATGTTGGATTAATCGTATTTTGTGCTAGAGCAATGAACCTATTCGAAGTGGCTCATTTCGTACCGGAGAAGTCCATGTATGAAAAATGATTAATTTTAATTACTTCCCACCTAGCTACTCTAGGTTGGGGGTTAGGTAGGTCCTGGTGGGGAAGTTATAGACACCTTTCCGTACTTTATGTCTTGAGTACTTCACTTAATTTCCTCTACAATATTGGATTTTGGCGGTATTTATCATGCACTTCTGGGACCCGAGACCCTTGAAGAATCTTTTCTATTCTTCGTTTATGTGTGGAAAGATAGAAATAAAATGACTACCATTTTAGGTATTCGCTTAATCTTGTTAGGTATAGGTGCTTTTCTTCTAGTATTCAAAGCTCTTTATTTTGGGGGCGTATATGATACCTGGGGAGATGTAAGAAAAATTACCAACTTGACCCTTAGCCCCAAGTGTTATATTTTAATTTTGTGATGGATAAATAATGTTGGGTGTTGTGATGAAACTAATAAAGATAAAAGATTTTTAGAATTTCTGGAAGGGAAACTTTTCTTTTGACGATTTTTCAATTTGTTTGAGTAATTTAAATGACTATTAAGTATATGTTCTTCTTAGCATGCTTTTATACGAGTACAATAAACAACGATGATAGTGATTGTTGTCCTGGCTAAATTGTCAAATTATATACGTAAATTGTAGGTATTGCTTCCTAATTTGGAGGATTTGATAATTACAATGGCAGATAATTTGAAGATGATATGGCAAAACGTACTTGTTCCTATTTCATTTTCCAAACTCAAAAGGGTAGAAATTGATTCATGTAATAATGTTGAAAAAGCTTTCCCTCCAAATATAACGACCATACTTACCTGCCTTCAATCCTTAACCGTTATGAATTGTAATTTATTGAAATGTATATTTGAATTGCAAGAGCCCAATACGACAGAAAAAAGTATTGTGCTCTCCAATTTGAGGTATTTGAAATTATATAATTTGCCAAGCCTAGAGTATGTATGGAGCAAGGATTCCAGTGAGCTTTCGAGTTTTGAAAATATGGAAAGCTTGTTCATTCAAGGATGTCCAAATCTTAAAAGAGCATATTCAATCAAAGTTCTTAAGCAACAAAAAGAGCTGGGAATAGATTTCAACCAATTGAAAGAGATTCTTGAGAAGGAAAAGTTGTCAGTACATATGATGGATTCAAATCAATTTCAGACTTCCCAGGTAATTCTACATCATGTTAGAATATTTTGTCAAATAATTTTTTGTCTCTCAACGCTTTAGAAACAGATTTCAAGCTAAGTTTGATAGTTTTTATTTTTCTAAATGACCAGGCTGGGACTACACAATTGCAAGATGGTCTGGAGTTGTTTCCCAAGCTTAAAACTCTTAAACTATATGGTTATTTGGACTACAACTCAACTCATCATTTGCCAATGGAAATGTTTCGAATGGTACACAATCTTGAAGAGTTTGAAGCAAGAAGGATGTTTATTAAAGAAATATTCCCAAATGAGAGATTGATGAATGTTGAAGAACAAAAGATCAATACAAGATTTGAGCCTTCTAGATTGGGTCTATACGAATTGCCCAAGCTTAAGCATTTCTGGAAGGATGAGTTCAAGAGTAGTTCATCACTTCAAAAATTGTATGATCTAATCATATCAGGATGTGGAGTATTGGATATGTTAGTGCCGTCGTCAGTATCTTTTACAAACTTGTGGAGGCTTGAGGTGAATAAATGTCATAGACTGACCCATTTGCTAAATCCTTCGGTGGCTAAAACCTTGGTGCAACTTACATGGTTGTGTTTAAAAGAATGCAAAAGGATGACGACTGTAATTGCAAGAGAAGTTGTTGAAGATCAAGGAAATGATGAAATTGTATTCCACAAATTATACATTTTAGAACTTGAGGATTTGTCCAAATTGACCAGCTTTCATTCTGGAAACTGCAACATCAGATTTCCGTGCTTGGGAAGTGTAGATATTAGGAGTTGTCCTGAAATGAAAGCTTTTTCTCCTGGAATCACAAGCACGCCTAACTTACTAGTTGGAGATATTAAGACTGAAGGTTCCTTTAGCCGGTATGGAATATCAGGTTCAAATGGCCGGCATGTAAATACTACAGAAGTTGTTGAAGATATCAATGGTATTATCCGACAAGTTTGGGAGGACGATTATGACGATGGGATTCAATATCTGTTTACGGAAAAGGTTAGTACCTCTTTGGGTATTTCCATGGATAATAAAGTGATGATCTTCAATTATGACTTCAATCAAAACTGCTGATTTTTCGTTGTTGCACTAATTAATATGTGTTGGTGTGATTTAATGCAGAATTTGGAGGAAGACCAAATATCCGATCATCCTTCTTCTCAATGA
mRNA sequence
ATGTCCACTTCGCGATTCCTTCTCGTTCTCCATCTTCAGCAAATTAACCACTCGGAGGCGTCGATTTCAAATGATGCTCCACGAATCCCTACTTCGCCTGCTACAATTTCTCCCTCCTCTAATGCTAGGGCTTCGGCGATGTCTGCATCCAGAGATGCGTTTATCTTCTTGCAAAAAGATTCTACCATAATTGCAAAAGGAGAAGGAGGTGAAGAAGAGGATGAAATCGTTTTCAGCCAATTGAAGATTTTGGAGCTCTTCAACTTACCCAATTTATCAACCTTTCATTTTGGGAAAAGCAGTTTTAAGTTGCCGTGCCTGGGAAAAGTGGTTGCGAAGAAATGCCCTGAAATGAAAGCATCTTCTGGTGGAGTTGTGAGCACGCCTGAACATTACTGGTATGTCAAATGTGGATCAGACGAGGGATTTTGGACAAGCAACGTCAATGCCACCATCAACCGGCTATGGTATGATGGCCATGACACTAGCCTTCAGAGTTTGTTTATTGAACAGGGGGGTGAGAAACTGGTTTGGTCGGGTGGGATCGGTTTTCATGTAAAAGGAGAAAACCGACTCGAGTTGAATTTAGAAAAGTTCGACATTGCTATCGGTATTGAATCTAGGATGGTTATGCACAAATTAGGGTTGTCTAGAACGTTGTATCTGAAGATGGAATCAGGAAGTTGTTTGGATGATTGGATAGAAATACTGCTAAAGAGGTGTGAAGAGCTGCAATTAGTAGGATCAATTGGTGCAAGAGTTCTAACCTTTGAGTTAGTTGAAAATGAGTGTTCACATTTGAAGCATCTCCACCTTTTTAAAAATCGAGAATTTCAACATTTAATCCACCAACAGAACAAGCCTTTACGAAAAACTTTATCCAATTTGGAGGACCTACAACTTCGCTGTTTGGAGAATTTGGAAAGTATAATTGATGGGCATGGGCATGTCACAGAACTTGCTTTCAACAAGTTGAGGAGTGTAGATGTGGAGTATTGTCATAAAATGGGAACTCTGTTTTACAACTGCATGGTGGATGACATTTTGAATCTTGAAGACATTTTTATTTATGGGTGTGAGATGTTGGAATATTTGATCACTGTGACGATGGAAAGCAAAGAGACAACCGCCACTATTGAGTTTCCGCACTTGAAATCTTTAGAGCTACGTTTCGTACAACGGGTTCGAGGTTTCTGCTCCAAAATGGATCAAATTGGCAATGAGAGTTCATTCTTCAGTGAAGAGTATGTATGGAGCAAGGATTCCAGTGAGCTTTCGAGTTTTGAAAATATGGAAAGCTTGTTCATTCAAGGATGTCCAAATCTTAAAAGAGCATATTCAATCAAAGTTCTTAAGCAACAAAAAGAGCTGGGAATAGATTTCAACCAATTGAAAGAGATTCTTGAGAAGGAAAAGTTGTCAGTACATATGATGGATTCAAATCAATTTCAGACTTCCCAGGCTGGGACTACACAATTGCAAGATGGTCTGGAGTTGTTTCCCAAGCTTAAAACTCTTAAACTATATGGTTATTTGGACTACAACTCAACTCATCATTTGCCAATGGAAATGTTTCGAATGGTACACAATCTTGAAGAGTTTGAAGCAAGAAGGATGTTTATTAAAGAAATATTCCCAAATGAGAGATTGATGAATGTTGAAGAACAAAAGATCAATACAAGATTTGAGCCTTCTAGATTGGGTCTATACGAATTGCCCAAGCTTAAGCATTTCTGGAAGGATGAGTTCAAGAGTAGTTCATCACTTCAAAAATTGTATGATCTAATCATATCAGGATGTGGAGTATTGGATATGTTAGTGCCGTCGTCAGTATCTTTTACAAACTTGTGGAGGCTTGAGGTGAATAAATGTCATAGACTGACCCATTTGCTAAATCCTTCGGTGGCTAAAACCTTGGTGCAACTTACATGGTTGTGTTTAAAAGAATGCAAAAGGATGACGACTGTAATTGCAAGAGAAGTTGTTGAAGATCAAGGAAATGATGAAATTGTATTCCACAAATTATACATTTTAGAACTTGAGGATTTGTCCAAATTGACCAGCTTTCATTCTGGAAACTGCAACATCAGATTTCCGTGCTTGGGAAGTGTAGATATTAGGAGTTGTCCTGAAATGAAAGCTTTTTCTCCTGGAATCACAAGCACGCCTAACTTACTAGTTGGAGATATTAAGACTGAAGGTTCCTTTAGCCGGTATGGAATATCAGGTTCAAATGGCCGGCATGTAAATACTACAGAAGTTGTTGAAGATATCAATGGTATTATCCGACAAGTTTGGGAGGACGATTATGACGATGGGATTCAATATCTGTTTACGGAAAAGAATTTGGAGGAAGACCAAATATCCGATCATCCTTCTTCTCAATGA
Coding sequence (CDS)
ATGTCCACTTCGCGATTCCTTCTCGTTCTCCATCTTCAGCAAATTAACCACTCGGAGGCGTCGATTTCAAATGATGCTCCACGAATCCCTACTTCGCCTGCTACAATTTCTCCCTCCTCTAATGCTAGGGCTTCGGCGATGTCTGCATCCAGAGATGCGTTTATCTTCTTGCAAAAAGATTCTACCATAATTGCAAAAGGAGAAGGAGGTGAAGAAGAGGATGAAATCGTTTTCAGCCAATTGAAGATTTTGGAGCTCTTCAACTTACCCAATTTATCAACCTTTCATTTTGGGAAAAGCAGTTTTAAGTTGCCGTGCCTGGGAAAAGTGGTTGCGAAGAAATGCCCTGAAATGAAAGCATCTTCTGGTGGAGTTGTGAGCACGCCTGAACATTACTGGTATGTCAAATGTGGATCAGACGAGGGATTTTGGACAAGCAACGTCAATGCCACCATCAACCGGCTATGGTATGATGGCCATGACACTAGCCTTCAGAGTTTGTTTATTGAACAGGGGGGTGAGAAACTGGTTTGGTCGGGTGGGATCGGTTTTCATGTAAAAGGAGAAAACCGACTCGAGTTGAATTTAGAAAAGTTCGACATTGCTATCGGTATTGAATCTAGGATGGTTATGCACAAATTAGGGTTGTCTAGAACGTTGTATCTGAAGATGGAATCAGGAAGTTGTTTGGATGATTGGATAGAAATACTGCTAAAGAGGTGTGAAGAGCTGCAATTAGTAGGATCAATTGGTGCAAGAGTTCTAACCTTTGAGTTAGTTGAAAATGAGTGTTCACATTTGAAGCATCTCCACCTTTTTAAAAATCGAGAATTTCAACATTTAATCCACCAACAGAACAAGCCTTTACGAAAAACTTTATCCAATTTGGAGGACCTACAACTTCGCTGTTTGGAGAATTTGGAAAGTATAATTGATGGGCATGGGCATGTCACAGAACTTGCTTTCAACAAGTTGAGGAGTGTAGATGTGGAGTATTGTCATAAAATGGGAACTCTGTTTTACAACTGCATGGTGGATGACATTTTGAATCTTGAAGACATTTTTATTTATGGGTGTGAGATGTTGGAATATTTGATCACTGTGACGATGGAAAGCAAAGAGACAACCGCCACTATTGAGTTTCCGCACTTGAAATCTTTAGAGCTACGTTTCGTACAACGGGTTCGAGGTTTCTGCTCCAAAATGGATCAAATTGGCAATGAGAGTTCATTCTTCAGTGAAGAGTATGTATGGAGCAAGGATTCCAGTGAGCTTTCGAGTTTTGAAAATATGGAAAGCTTGTTCATTCAAGGATGTCCAAATCTTAAAAGAGCATATTCAATCAAAGTTCTTAAGCAACAAAAAGAGCTGGGAATAGATTTCAACCAATTGAAAGAGATTCTTGAGAAGGAAAAGTTGTCAGTACATATGATGGATTCAAATCAATTTCAGACTTCCCAGGCTGGGACTACACAATTGCAAGATGGTCTGGAGTTGTTTCCCAAGCTTAAAACTCTTAAACTATATGGTTATTTGGACTACAACTCAACTCATCATTTGCCAATGGAAATGTTTCGAATGGTACACAATCTTGAAGAGTTTGAAGCAAGAAGGATGTTTATTAAAGAAATATTCCCAAATGAGAGATTGATGAATGTTGAAGAACAAAAGATCAATACAAGATTTGAGCCTTCTAGATTGGGTCTATACGAATTGCCCAAGCTTAAGCATTTCTGGAAGGATGAGTTCAAGAGTAGTTCATCACTTCAAAAATTGTATGATCTAATCATATCAGGATGTGGAGTATTGGATATGTTAGTGCCGTCGTCAGTATCTTTTACAAACTTGTGGAGGCTTGAGGTGAATAAATGTCATAGACTGACCCATTTGCTAAATCCTTCGGTGGCTAAAACCTTGGTGCAACTTACATGGTTGTGTTTAAAAGAATGCAAAAGGATGACGACTGTAATTGCAAGAGAAGTTGTTGAAGATCAAGGAAATGATGAAATTGTATTCCACAAATTATACATTTTAGAACTTGAGGATTTGTCCAAATTGACCAGCTTTCATTCTGGAAACTGCAACATCAGATTTCCGTGCTTGGGAAGTGTAGATATTAGGAGTTGTCCTGAAATGAAAGCTTTTTCTCCTGGAATCACAAGCACGCCTAACTTACTAGTTGGAGATATTAAGACTGAAGGTTCCTTTAGCCGGTATGGAATATCAGGTTCAAATGGCCGGCATGTAAATACTACAGAAGTTGTTGAAGATATCAATGGTATTATCCGACAAGTTTGGGAGGACGATTATGACGATGGGATTCAATATCTGTTTACGGAAAAGAATTTGGAGGAAGACCAAATATCCGATCATCCTTCTTCTCAATGA
Protein sequence
MSTSRFLLVLHLQQINHSEASISNDAPRIPTSPATISPSSNARASAMSASRDAFIFLQKDSTIIAKGEGGEEEDEIVFSQLKILELFNLPNLSTFHFGKSSFKLPCLGKVVAKKCPEMKASSGGVVSTPEHYWYVKCGSDEGFWTSNVNATINRLWYDGHDTSLQSLFIEQGGEKLVWSGGIGFHVKGENRLELNLEKFDIAIGIESRMVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGARVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIIDGHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMESKETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNESSFFSEEYVWSKDSSELSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLKEILEKEKLSVHMMDSNQFQTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMFRMVHNLEEFEARRMFIKEIFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKSSSSLQKLYDLIISGCGVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWLCLKECKRMTTVIAREVVEDQGNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVDIRSCPEMKAFSPGITSTPNLLVGDIKTEGSFSRYGISGSNGRHVNTTEVVEDINGIIRQVWEDDYDDGIQYLFTEKNLEEDQISDHPSSQ
Homology
BLAST of Moc08g42390 vs. NCBI nr
Match:
XP_022150758.1 (uncharacterized protein LOC111018819 [Momordica charantia])
HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 587/690 (85.07%), Postives = 587/690 (85.07%), Query Frame = 0
Query: 209 MVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGARVLTFELVENECSHLK 268
MVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGARVLTFELVENECSHLK
Sbjct: 1 MVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGARVLTFELVENECSHLK 60
Query: 269 HLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIIDGHGHVTELAFNKLRSV 328
HLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIIDGHGHVTELAFNKLRSV
Sbjct: 61 HLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIIDGHGHVTELAFNKLRSV 120
Query: 329 DVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMESKETTATIEFPHLKSLE 388
DVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMESKETTATIEFPHLKSLE
Sbjct: 121 DVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMESKETTATIEFPHLKSLE 180
Query: 389 LRFVQRVRGFCSKMDQIGNESSFFSE---------------------------------- 448
LRFVQRVRGFCSKMDQIGNESSFFSE
Sbjct: 181 LRFVQRVRGFCSKMDQIGNESSFFSEEVLLPNLEDLIITMADNLKMIWQNVLVPISFSKL 240
Query: 449 ------------------------------------------------------------ 508
Sbjct: 241 KRVEIDSCNNVEKAFPPNITTILTCLQSLTVMNCNLLKCIFELQEPNTTEKSIVLSNLRY 300
Query: 509 ---------EYVWSKDSSELSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLK 568
EYVWSKDSSELSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLK
Sbjct: 301 LKLYNLPSLEYVWSKDSSELSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLK 360
Query: 569 EILEKEKLSVHMMDSNQFQTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMF 628
EILEKEKLSVHMMDSNQFQTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMF
Sbjct: 361 EILEKEKLSVHMMDSNQFQTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMF 420
Query: 629 RMVHNLEEFEARRMFIKEIFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS 688
RMVHNLEEFEARRMFIKEIFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS
Sbjct: 421 RMVHNLEEFEARRMFIKEIFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS 480
Query: 689 SSSLQKLYDLIISGCGVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWL 748
SSSLQKLYDLIISGCGVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWL
Sbjct: 481 SSSLQKLYDLIISGCGVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWL 540
Query: 749 CLKECKRMTTVIAREVVEDQGNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVD 796
CLKECKRMTTVIAREVVEDQGNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVD
Sbjct: 541 CLKECKRMTTVIAREVVEDQGNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVD 600
BLAST of Moc08g42390 vs. NCBI nr
Match:
XP_022150721.1 (probable disease resistance protein At4g27220 [Momordica charantia])
HSP 1 Score: 537.3 bits (1383), Expect = 2.2e-148
Identity = 300/444 (67.57%), Postives = 314/444 (70.72%), Query Frame = 0
Query: 193 ELNLEKFDIAIGIESRMVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA 252
ELNLEKF+IAIGIESRMVMHK G SRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA
Sbjct: 325 ELNLEKFNIAIGIESRMVMHK-GFSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA 384
Query: 253 RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID 312
RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID
Sbjct: 385 RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID 444
Query: 313 GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES 372
GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES
Sbjct: 445 GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES 504
Query: 373 KETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNESSFFSE------------------ 432
KETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNESSFFSE
Sbjct: 505 KETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNESSFFSEEVLIPNLRDLTITRADNL 564
Query: 433 ------------------------------------------------------------ 492
Sbjct: 565 KMIWHNVLVPNSFSKLETVQIESCNNIEKVFPPNIMSVLSCLKSLTIRDCKLLKCIFEVQ 624
Query: 493 --------------------------EYVWSKDSSELSSFENMESLFIQGCPNLKRAYSI 533
EYVWSKDS EL FEN++ L I+ C L+RAY I
Sbjct: 625 EANTREKSIDLLSNLRDLILYNLPSLEYVWSKDSCELLIFENIKVLSIEECHKLQRAYPI 684
BLAST of Moc08g42390 vs. NCBI nr
Match:
XP_038890456.1 (probable disease resistance protein At4g27220 isoform X1 [Benincasa hispida])
HSP 1 Score: 453.0 bits (1164), Expect = 5.4e-123
Identity = 291/715 (40.70%), Postives = 403/715 (56.36%), Query Frame = 0
Query: 194 LNLEKFDIAIGIESRMVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGAR 253
LNLE F I IG + + K+ +SRTL LK+E+ SC+D+ I++L KR EEL L GSIG+R
Sbjct: 705 LNLETFKIFIGCKP-IGCWKMEVSRTLGLKIETESCVDNEIKMLSKRSEELHLAGSIGSR 764
Query: 254 VLTFELVENECSHLKHLHLFKNREFQHLI-HQQNK-PLRKTLSNLEDLQLRCLENLESII 313
VL FEL NE S+L+HL+++ N EFQH +++NK L+K LSNLE L+L+ LENLE++
Sbjct: 765 VLPFELNGNESSYLRHLYIYDNSEFQHFFNYERNKLSLQKVLSNLEVLELKNLENLETMF 824
Query: 314 DGHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTME 373
G +V E F KL+ + + C+K+ LF + ++ L LE++ I CEM++ ++ + E
Sbjct: 825 HGVHNVRESHFYKLKKIKLLRCNKLEILFVDFSLNKFLRLEEMKISDCEMMKAIVVI--E 884
Query: 374 SKETTATIEFPHLKSLELRFVQRVRGFCSKMDQIG----------------NESSFFSE- 433
S++ T IEF +LKSL L + R++ F SK+++ G N SFF++
Sbjct: 885 SEKATNKIEFMNLKSLNLEGLPRLQSFFSKIEKHGQLCVDNFERDETSRCSNHDSFFNQW 944
Query: 434 ------------------------------------------------------------ 493
Sbjct: 945 VSLPNLEQLKIKEAQNLKMIFHNILIPNSFSKLESLMIGECNNLEKVFPSNIISTFTCLK 1004
Query: 494 -------------------------------------------EYVWSKDSSELSSFENM 553
+Y+W KD EL +N+
Sbjct: 1005 ILRIKSCNLLEGVFEVQEPNAIQKNNDLLPSLRHLELIELPNLQYIWEKDPCELLKAKNL 1064
Query: 554 ESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLKEILEKEKLSVHMMDSNQFQTSQAGTT 613
E LFI CP LKR Y I VL+Q K L ID ++L EIL+KEK S +++ +Q +TS+A
Sbjct: 1065 EILFISQCPKLKREYPINVLRQLKNLEIDLSELNEILKKEK-STQILEFDQLETSKAEII 1124
Query: 614 QLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMFRMVHNLEEFEARRMFIKEIFPNERLM 673
QL+DGL LF KL+ LKL+G LD T LP+E+ +++HNLE FE R+ I+E+F +ERL
Sbjct: 1125 QLRDGLHLFFKLENLKLHGSLDDRYT-QLPIEIVQILHNLEVFEVRKALIEEVFSSERLD 1184
Query: 674 NVEEQKINTRFEPSRLGLYELPKLKHFWKDEF-KSSSSLQKLYDLIISGCGVLDMLVPSS 733
E N + S L LYELPKL+H ++ KSSS LQ L L + GCG+L+M++PSS
Sbjct: 1185 YSLEDWQNKKINLSSLSLYELPKLRHLCNEDLQKSSSILQNLRYLKVFGCGILNMILPSS 1244
Query: 734 VSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWLCLKECKRMTTVIAREVVEDQGNDEI 786
+ FTNL +L V CH+LT+LLNPS+ + LV L L ++ CKRMTTVIA +E + NDEI
Sbjct: 1245 MPFTNLAQLRVENCHQLTYLLNPSIGRRLVNLVVLAIEGCKRMTTVIAGG-IELEENDEI 1304
BLAST of Moc08g42390 vs. NCBI nr
Match:
XP_016901814.1 (PREDICTED: probable disease resistance protein At1g63360 isoform X1 [Cucumis melo])
HSP 1 Score: 439.9 bits (1130), Expect = 4.7e-119
Identity = 300/739 (40.60%), Postives = 396/739 (53.59%), Query Frame = 0
Query: 193 ELNLEKFDIAIGIESRMVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA 252
ELNLEKF I IG + + + +KMESGSCLDDWI+ILLKR EE+ L GSI +
Sbjct: 711 ELNLEKFVINIGCQRDGRYIYENNTSFIGIKMESGSCLDDWIKILLKRSEEVHLKGSICS 770
Query: 253 RVLTFELVE-NECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESII 312
++L ELV+ N+ HLK+L+L+ + +FQH IH++NKPLRK LS LE L L L NLES+I
Sbjct: 771 KILHSELVDANDFVHLKYLYLYDDSKFQHFIHEKNKPLRKCLSKLEYLNLNNLGNLESVI 830
Query: 313 DGHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTME 372
HG+ E N L++V + C+K+ TLF+N +DDILNLE + + CE +E +ITV E
Sbjct: 831 --HGYHGESPLNNLKNVIISNCNKLKTLFFNYNLDDILNLEQLEVNVCEKMEVMITV-KE 890
Query: 373 SKETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNES--------------------SF 432
++E T IEF HLKSL LR++ R++ FCSK+++ G S SF
Sbjct: 891 NEEATNHIEFTHLKSLSLRYLSRLQKFCSKIEKFGQLSEDNSTNPRISTDSNTTNIGESF 950
Query: 433 FSE--------------------------------------------------------- 492
FSE
Sbjct: 951 FSEEVSLPNLEKLKIRSATNLKMIWSNNVLVPNSFSKLKEINIYSCNNLQKVLFSSNMMN 1010
Query: 493 --------------------------------------------------EYVWSKDSSE 552
EYVWSK+ SE
Sbjct: 1011 ILTCLKILIIEDCKLLEGIFEVQEPINIVEASPIVLQNLNELKLYNLPNLEYVWSKNPSE 1070
Query: 553 LSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLKEILEKEKLSVH-MMDSNQF 612
L S EN++SL I CP L+R YS+K+LKQ + L ID Q E++ K+K + + ++S Q
Sbjct: 1071 LLSLENIKSLTIDECPRLRREYSVKILKQLEALSIDIKQFVEVIWKKKSADYDRLESKQL 1130
Query: 613 QTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMFRMVHNLEEFEARRMFIKE 672
+TS ++++ D +L P LK LKLYG+++YNST HLPMEM +++ LE+FE FI+E
Sbjct: 1131 ETS---SSKVGDSSKLLPNLKKLKLYGFVEYNST-HLPMEMLEILYQLEDFELEGAFIEE 1190
Query: 673 IFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS---SSSLQKLYDLIISGC 732
IFP+ L I + R L +LPKLKH W +EF +S LQ L L IS C
Sbjct: 1191 IFPSNIL-------IPSYMVLRRFALSKLPKLKHLWDEEFSQNNITSVLQDLLILSISEC 1250
Query: 733 GVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWLCLKECKRMTTVIARE 792
G L LVPS V FTNL +V KC LTHLLNP VA LV L L ++ECKRM++VI R
Sbjct: 1251 GRLSSLVPSLVCFTNLVVFDVIKCDGLTHLLNPLVATKLVHLEHLRIEECKRMSSVIERG 1310
Query: 793 VVEDQGNDE-IVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVDIRSCPEMKAFSPGI 795
E+ GNDE IVF+ L +L + S LTSF+ G C I+FPCL V I+ CPEMK FS GI
Sbjct: 1311 SAEEDGNDEIIVFNSLQLLIITSCSNLTSFYRGGCIIKFPCLEEVYIQKCPEMKVFSFGI 1370
BLAST of Moc08g42390 vs. NCBI nr
Match:
XP_008441731.1 (PREDICTED: probable disease resistance protein At4g27220 [Cucumis melo] >XP_008441732.1 PREDICTED: probable disease resistance protein At4g27220 [Cucumis melo] >XP_008441734.1 PREDICTED: probable disease resistance protein At4g27220 [Cucumis melo] >XP_016899499.1 PREDICTED: probable disease resistance protein At4g27220 [Cucumis melo])
HSP 1 Score: 424.1 bits (1089), Expect = 2.7e-114
Identity = 282/721 (39.11%), Postives = 384/721 (53.26%), Query Frame = 0
Query: 194 LNLEKFDIAIGIESRMVMHKLGLSRTLYLKM-ESGSCLDDWIEILLKRCEELQLVGSIGA 253
LNLEKFDI IG R + +SR L LKM E+G+ +D+ I +LLKR EEL LVGS+GA
Sbjct: 709 LNLEKFDITIGCAPRGFWSR-EISRVLCLKMAETGTDIDNGINMLLKRSEELHLVGSVGA 768
Query: 254 RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID 313
RVL FEL ENE HLK L+++ N +FQH +Q P + S LE L+L LENLESI
Sbjct: 769 RVLPFELKENETLHLKKLYIYDNSKFQHFNLEQKNPFQNVWSKLEYLKLSNLENLESIFH 828
Query: 314 GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES 373
HV NKL+ + + C+K+ +LFY ++DD+ +LE+I I GC M+ ++ +
Sbjct: 829 -CDHVRGSQLNKLKVIKLLGCNKLRSLFYYSILDDLFHLEEIKIIGCAMMRTIV----GN 888
Query: 374 KETTATIEFPHLKSLELRFVQRVRGFCSKMDQ--------------IGNESSFFSE---- 433
++ T IE LK L L + R+ F SK+++ N SFF+E
Sbjct: 889 EKATEKIELASLKYLTLMDLPRLHSFFSKIEKHEQSCLDNLQPDKTSRNNDSFFNELVSL 948
Query: 434 ------------------------------------------------------------ 493
Sbjct: 949 PNLVRLRIGEAHNLKMIFHNILIPNSFSKLESLWIVECNNLEKVFPSNIMSRLTCLKLLI 1008
Query: 494 ----------------------------------------EYVWSKDSSELSSFENMESL 553
+Y+W + ELS +N+E L
Sbjct: 1009 IMNCNLLEGVFEMQEPKGTKKSIDLLPSLRHLELIELPNLQYIWEDNFYELSKVKNIEKL 1068
Query: 554 FIQGCPNLKRAYSIKVLKQQKELGIDFNQLKEILEKEKLSVHMMDSNQFQTSQAGTTQLQ 613
I+ CP LK Y +KVL+Q + L ID LKEI KEK + M++ + +TS+ +
Sbjct: 1069 DIRQCPKLKIEYPMKVLRQLEMLTIDLRDLKEIPLKEK-TTQMLELEEMETSKDEIIPFR 1128
Query: 614 DGLELFPKLKTLKLYGYLDYNSTHHLPMEMFRMVHNLEEFEARRMFIKEIFPNERLMNVE 673
DG +LF +LK L+LYG DY T HLPM + +++HN+E FE R+ F +E+FP ER +
Sbjct: 1129 DGSKLFSRLKHLRLYGSFDYCQT-HLPMRIVQILHNIEVFEVRKTFFEEVFPIERSWDNV 1188
Query: 674 EQKINTRFEPSRLGLYELPKLKHFWKDEF-KSSSSLQKLYDLIISGCGVLDMLVPSSVSF 733
E+ N R++ SRL L+ELPKL++ W K+SS +Q L +L + GCG+L M VPSS+SF
Sbjct: 1189 EEWQNERYKLSRLKLFELPKLRYLWSGGLQKNSSIVQNLMELNVLGCGILSMSVPSSMSF 1248
Query: 734 TNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWLCLKECKRMTTVIAREVVEDQGNDEIVFH 793
NL L V KCH++T+LLNPSVA+TLVQL L L ECKRM TVI V E+ NDEI+F+
Sbjct: 1249 RNLTWLTVRKCHKMTYLLNPSVARTLVQLRLLVLGECKRMITVIVEGVEEE--NDEILFN 1308
Query: 794 KLYILELEDLSKLTSFHSGNCNIRFPCLGSVDIRSCPEMKAFSPGITSTPNLLVGDIKTE 795
+L ++L D+ KLTSFHSG C IRFPCL + I +CPEM+ FS GI STP LL +I
Sbjct: 1309 RLDSIDLRDMLKLTSFHSGKCTIRFPCLDELAIENCPEMRDFSLGIVSTPLLLTENIGLY 1368
BLAST of Moc08g42390 vs. ExPASy TrEMBL
Match:
A0A6J1DAA6 (uncharacterized protein LOC111018819 OS=Momordica charantia OX=3673 GN=LOC111018819 PE=4 SV=1)
HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 587/690 (85.07%), Postives = 587/690 (85.07%), Query Frame = 0
Query: 209 MVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGARVLTFELVENECSHLK 268
MVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGARVLTFELVENECSHLK
Sbjct: 1 MVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGARVLTFELVENECSHLK 60
Query: 269 HLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIIDGHGHVTELAFNKLRSV 328
HLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIIDGHGHVTELAFNKLRSV
Sbjct: 61 HLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIIDGHGHVTELAFNKLRSV 120
Query: 329 DVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMESKETTATIEFPHLKSLE 388
DVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMESKETTATIEFPHLKSLE
Sbjct: 121 DVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMESKETTATIEFPHLKSLE 180
Query: 389 LRFVQRVRGFCSKMDQIGNESSFFSE---------------------------------- 448
LRFVQRVRGFCSKMDQIGNESSFFSE
Sbjct: 181 LRFVQRVRGFCSKMDQIGNESSFFSEEVLLPNLEDLIITMADNLKMIWQNVLVPISFSKL 240
Query: 449 ------------------------------------------------------------ 508
Sbjct: 241 KRVEIDSCNNVEKAFPPNITTILTCLQSLTVMNCNLLKCIFELQEPNTTEKSIVLSNLRY 300
Query: 509 ---------EYVWSKDSSELSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLK 568
EYVWSKDSSELSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLK
Sbjct: 301 LKLYNLPSLEYVWSKDSSELSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLK 360
Query: 569 EILEKEKLSVHMMDSNQFQTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMF 628
EILEKEKLSVHMMDSNQFQTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMF
Sbjct: 361 EILEKEKLSVHMMDSNQFQTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMF 420
Query: 629 RMVHNLEEFEARRMFIKEIFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS 688
RMVHNLEEFEARRMFIKEIFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS
Sbjct: 421 RMVHNLEEFEARRMFIKEIFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS 480
Query: 689 SSSLQKLYDLIISGCGVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWL 748
SSSLQKLYDLIISGCGVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWL
Sbjct: 481 SSSLQKLYDLIISGCGVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWL 540
Query: 749 CLKECKRMTTVIAREVVEDQGNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVD 796
CLKECKRMTTVIAREVVEDQGNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVD
Sbjct: 541 CLKECKRMTTVIAREVVEDQGNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVD 600
BLAST of Moc08g42390 vs. ExPASy TrEMBL
Match:
A0A6J1DCC9 (probable disease resistance protein At4g27220 OS=Momordica charantia OX=3673 GN=LOC111018787 PE=4 SV=1)
HSP 1 Score: 537.3 bits (1383), Expect = 1.1e-148
Identity = 300/444 (67.57%), Postives = 314/444 (70.72%), Query Frame = 0
Query: 193 ELNLEKFDIAIGIESRMVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA 252
ELNLEKF+IAIGIESRMVMHK G SRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA
Sbjct: 325 ELNLEKFNIAIGIESRMVMHK-GFSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA 384
Query: 253 RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID 312
RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID
Sbjct: 385 RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID 444
Query: 313 GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES 372
GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES
Sbjct: 445 GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES 504
Query: 373 KETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNESSFFSE------------------ 432
KETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNESSFFSE
Sbjct: 505 KETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNESSFFSEEVLIPNLRDLTITRADNL 564
Query: 433 ------------------------------------------------------------ 492
Sbjct: 565 KMIWHNVLVPNSFSKLETVQIESCNNIEKVFPPNIMSVLSCLKSLTIRDCKLLKCIFEVQ 624
Query: 493 --------------------------EYVWSKDSSELSSFENMESLFIQGCPNLKRAYSI 533
EYVWSKDS EL FEN++ L I+ C L+RAY I
Sbjct: 625 EANTREKSIDLLSNLRDLILYNLPSLEYVWSKDSCELLIFENIKVLSIEECHKLQRAYPI 684
BLAST of Moc08g42390 vs. ExPASy TrEMBL
Match:
A0A1S4E0R8 (probable disease resistance protein At1g63360 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495494 PE=4 SV=1)
HSP 1 Score: 439.9 bits (1130), Expect = 2.3e-119
Identity = 300/739 (40.60%), Postives = 396/739 (53.59%), Query Frame = 0
Query: 193 ELNLEKFDIAIGIESRMVMHKLGLSRTLYLKMESGSCLDDWIEILLKRCEELQLVGSIGA 252
ELNLEKF I IG + + + +KMESGSCLDDWI+ILLKR EE+ L GSI +
Sbjct: 711 ELNLEKFVINIGCQRDGRYIYENNTSFIGIKMESGSCLDDWIKILLKRSEEVHLKGSICS 770
Query: 253 RVLTFELVE-NECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESII 312
++L ELV+ N+ HLK+L+L+ + +FQH IH++NKPLRK LS LE L L L NLES+I
Sbjct: 771 KILHSELVDANDFVHLKYLYLYDDSKFQHFIHEKNKPLRKCLSKLEYLNLNNLGNLESVI 830
Query: 313 DGHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTME 372
HG+ E N L++V + C+K+ TLF+N +DDILNLE + + CE +E +ITV E
Sbjct: 831 --HGYHGESPLNNLKNVIISNCNKLKTLFFNYNLDDILNLEQLEVNVCEKMEVMITV-KE 890
Query: 373 SKETTATIEFPHLKSLELRFVQRVRGFCSKMDQIGNES--------------------SF 432
++E T IEF HLKSL LR++ R++ FCSK+++ G S SF
Sbjct: 891 NEEATNHIEFTHLKSLSLRYLSRLQKFCSKIEKFGQLSEDNSTNPRISTDSNTTNIGESF 950
Query: 433 FSE--------------------------------------------------------- 492
FSE
Sbjct: 951 FSEEVSLPNLEKLKIRSATNLKMIWSNNVLVPNSFSKLKEINIYSCNNLQKVLFSSNMMN 1010
Query: 493 --------------------------------------------------EYVWSKDSSE 552
EYVWSK+ SE
Sbjct: 1011 ILTCLKILIIEDCKLLEGIFEVQEPINIVEASPIVLQNLNELKLYNLPNLEYVWSKNPSE 1070
Query: 553 LSSFENMESLFIQGCPNLKRAYSIKVLKQQKELGIDFNQLKEILEKEKLSVH-MMDSNQF 612
L S EN++SL I CP L+R YS+K+LKQ + L ID Q E++ K+K + + ++S Q
Sbjct: 1071 LLSLENIKSLTIDECPRLRREYSVKILKQLEALSIDIKQFVEVIWKKKSADYDRLESKQL 1130
Query: 613 QTSQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMFRMVHNLEEFEARRMFIKE 672
+TS ++++ D +L P LK LKLYG+++YNST HLPMEM +++ LE+FE FI+E
Sbjct: 1131 ETS---SSKVGDSSKLLPNLKKLKLYGFVEYNST-HLPMEMLEILYQLEDFELEGAFIEE 1190
Query: 673 IFPNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKS---SSSLQKLYDLIISGC 732
IFP+ L I + R L +LPKLKH W +EF +S LQ L L IS C
Sbjct: 1191 IFPSNIL-------IPSYMVLRRFALSKLPKLKHLWDEEFSQNNITSVLQDLLILSISEC 1250
Query: 733 GVLDMLVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWLCLKECKRMTTVIARE 792
G L LVPS V FTNL +V KC LTHLLNP VA LV L L ++ECKRM++VI R
Sbjct: 1251 GRLSSLVPSLVCFTNLVVFDVIKCDGLTHLLNPLVATKLVHLEHLRIEECKRMSSVIERG 1310
Query: 793 VVEDQGNDE-IVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVDIRSCPEMKAFSPGI 795
E+ GNDE IVF+ L +L + S LTSF+ G C I+FPCL V I+ CPEMK FS GI
Sbjct: 1311 SAEEDGNDEIIVFNSLQLLIITSCSNLTSFYRGGCIIKFPCLEEVYIQKCPEMKVFSFGI 1370
BLAST of Moc08g42390 vs. ExPASy TrEMBL
Match:
A0A1S3B439 (probable disease resistance protein At4g27220 OS=Cucumis melo OX=3656 GN=LOC103485808 PE=4 SV=1)
HSP 1 Score: 424.1 bits (1089), Expect = 1.3e-114
Identity = 282/721 (39.11%), Postives = 384/721 (53.26%), Query Frame = 0
Query: 194 LNLEKFDIAIGIESRMVMHKLGLSRTLYLKM-ESGSCLDDWIEILLKRCEELQLVGSIGA 253
LNLEKFDI IG R + +SR L LKM E+G+ +D+ I +LLKR EEL LVGS+GA
Sbjct: 709 LNLEKFDITIGCAPRGFWSR-EISRVLCLKMAETGTDIDNGINMLLKRSEELHLVGSVGA 768
Query: 254 RVLTFELVENECSHLKHLHLFKNREFQHLIHQQNKPLRKTLSNLEDLQLRCLENLESIID 313
RVL FEL ENE HLK L+++ N +FQH +Q P + S LE L+L LENLESI
Sbjct: 769 RVLPFELKENETLHLKKLYIYDNSKFQHFNLEQKNPFQNVWSKLEYLKLSNLENLESIFH 828
Query: 314 GHGHVTELAFNKLRSVDVEYCHKMGTLFYNCMVDDILNLEDIFIYGCEMLEYLITVTMES 373
HV NKL+ + + C+K+ +LFY ++DD+ +LE+I I GC M+ ++ +
Sbjct: 829 -CDHVRGSQLNKLKVIKLLGCNKLRSLFYYSILDDLFHLEEIKIIGCAMMRTIV----GN 888
Query: 374 KETTATIEFPHLKSLELRFVQRVRGFCSKMDQ--------------IGNESSFFSE---- 433
++ T IE LK L L + R+ F SK+++ N SFF+E
Sbjct: 889 EKATEKIELASLKYLTLMDLPRLHSFFSKIEKHEQSCLDNLQPDKTSRNNDSFFNELVSL 948
Query: 434 ------------------------------------------------------------ 493
Sbjct: 949 PNLVRLRIGEAHNLKMIFHNILIPNSFSKLESLWIVECNNLEKVFPSNIMSRLTCLKLLI 1008
Query: 494 ----------------------------------------EYVWSKDSSELSSFENMESL 553
+Y+W + ELS +N+E L
Sbjct: 1009 IMNCNLLEGVFEMQEPKGTKKSIDLLPSLRHLELIELPNLQYIWEDNFYELSKVKNIEKL 1068
Query: 554 FIQGCPNLKRAYSIKVLKQQKELGIDFNQLKEILEKEKLSVHMMDSNQFQTSQAGTTQLQ 613
I+ CP LK Y +KVL+Q + L ID LKEI KEK + M++ + +TS+ +
Sbjct: 1069 DIRQCPKLKIEYPMKVLRQLEMLTIDLRDLKEIPLKEK-TTQMLELEEMETSKDEIIPFR 1128
Query: 614 DGLELFPKLKTLKLYGYLDYNSTHHLPMEMFRMVHNLEEFEARRMFIKEIFPNERLMNVE 673
DG +LF +LK L+LYG DY T HLPM + +++HN+E FE R+ F +E+FP ER +
Sbjct: 1129 DGSKLFSRLKHLRLYGSFDYCQT-HLPMRIVQILHNIEVFEVRKTFFEEVFPIERSWDNV 1188
Query: 674 EQKINTRFEPSRLGLYELPKLKHFWKDEF-KSSSSLQKLYDLIISGCGVLDMLVPSSVSF 733
E+ N R++ SRL L+ELPKL++ W K+SS +Q L +L + GCG+L M VPSS+SF
Sbjct: 1189 EEWQNERYKLSRLKLFELPKLRYLWSGGLQKNSSIVQNLMELNVLGCGILSMSVPSSMSF 1248
Query: 734 TNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWLCLKECKRMTTVIAREVVEDQGNDEIVFH 793
NL L V KCH++T+LLNPSVA+TLVQL L L ECKRM TVI V E+ NDEI+F+
Sbjct: 1249 RNLTWLTVRKCHKMTYLLNPSVARTLVQLRLLVLGECKRMITVIVEGVEEE--NDEILFN 1308
Query: 794 KLYILELEDLSKLTSFHSGNCNIRFPCLGSVDIRSCPEMKAFSPGITSTPNLLVGDIKTE 795
+L ++L D+ KLTSFHSG C IRFPCL + I +CPEM+ FS GI STP LL +I
Sbjct: 1309 RLDSIDLRDMLKLTSFHSGKCTIRFPCLDELAIENCPEMRDFSLGIVSTPLLLTENIGLY 1368
BLAST of Moc08g42390 vs. ExPASy TrEMBL
Match:
A0A6J1D9T1 (probable disease resistance protein At1g61300 OS=Momordica charantia OX=3673 GN=LOC111018197 PE=4 SV=1)
HSP 1 Score: 386.0 bits (990), Expect = 3.9e-103
Identity = 217/322 (67.39%), Postives = 250/322 (77.64%), Query Frame = 0
Query: 486 SQAGTTQLQDGLELFPKLKTLKLYGYLDYNSTHHLPMEMFRMVHNLEEFEARRMFIKEIF 545
SQ TTQLQDGL+LF KLK+LKL G L YNS+ HLP+E+ R+VHNLE FE RRM +KEIF
Sbjct: 4 SQVETTQLQDGLKLFSKLKSLKLSGSLIYNSS-HLPIEIVRIVHNLERFELRRMLVKEIF 63
Query: 546 PNERLMNVEEQKINTRFEPSRLGLYELPKLKHFWKDEFKSSSSLQKLYDLIISGCGVLDM 605
PNE+L+NVEE + N RFEPS L L+ELPKLKHFWKD++KS+SSL+ L LIISGCG+LDM
Sbjct: 64 PNEKLINVEEYR-NIRFEPSDLSLFELPKLKHFWKDDYKSTSSLKNLASLIISGCGILDM 123
Query: 606 LVPSSVSFTNLWRLEVNKCHRLTHLLNPSVAKTLVQLTWLCLKECKRMTTVIAREVVEDQ 665
LVPSSVSF NL +LEV+KCHRLTHLLNPSVA+TLVQL L LK+CKRMTTVIA EVVE +
Sbjct: 124 LVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIA-EVVE-E 183
Query: 666 GNDEIVFHKLYILELEDLSKLTSFHSGNCNIRFPCLGSVDIRSCPEMKAFSPGITSTPNL 725
GN+EIVF +L L LEDLSKLTSFHSG C IRFP L V I +CP+M+ FS GI ST NL
Sbjct: 184 GNEEIVFSRLKYLFLEDLSKLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNL 243
Query: 726 LVGDIKTEGSF--SRYGISGSNGRHVNTTEVVEDINGIIR-----------QVWEDDYDD 785
LV D++ SRYG SN + ++VVEDINGIIR Q WED+YD
Sbjct: 244 LVRDLRIHHGIKGSRYGYEDSNYGY-EDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT 303
Query: 786 GIQYLFTEKNLEEDQISDHPSS 795
GIQYLFTEKNLEE+Q SDH SS
Sbjct: 304 GIQYLFTEKNLEENQ-SDHSSS 319
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022150758.1 | 0.0e+00 | 85.07 | uncharacterized protein LOC111018819 [Momordica charantia] | [more] |
XP_022150721.1 | 2.2e-148 | 67.57 | probable disease resistance protein At4g27220 [Momordica charantia] | [more] |
XP_038890456.1 | 5.4e-123 | 40.70 | probable disease resistance protein At4g27220 isoform X1 [Benincasa hispida] | [more] |
XP_016901814.1 | 4.7e-119 | 40.60 | PREDICTED: probable disease resistance protein At1g63360 isoform X1 [Cucumis mel... | [more] |
XP_008441731.1 | 2.7e-114 | 39.11 | PREDICTED: probable disease resistance protein At4g27220 [Cucumis melo] >XP_0084... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DAA6 | 0.0e+00 | 85.07 | uncharacterized protein LOC111018819 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DCC9 | 1.1e-148 | 67.57 | probable disease resistance protein At4g27220 OS=Momordica charantia OX=3673 GN=... | [more] |
A0A1S4E0R8 | 2.3e-119 | 40.60 | probable disease resistance protein At1g63360 isoform X1 OS=Cucumis melo OX=3656... | [more] |
A0A1S3B439 | 1.3e-114 | 39.11 | probable disease resistance protein At4g27220 OS=Cucumis melo OX=3656 GN=LOC1034... | [more] |
A0A6J1D9T1 | 3.9e-103 | 67.39 | probable disease resistance protein At1g61300 OS=Momordica charantia OX=3673 GN=... | [more] |
Match Name | E-value | Identity | Description | |