Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCCGTACAAATTTCCGTTTTTTGGTTCACGATCGCTTCGAATGGCCGGCGCTGCTAAATCCCAATCCTAAACCCTTTTTTTCTGTTCAACTACTGTGAGCTGTTCCTTTCTTTCCATTTCGTTCTTTTTCTCTGAAGCCTAGATGAGTAGATCAATCGGAAGAAAGGTTCCAGGTTTTTCACTTCTATCAAATGCCAACAAACTAAGCTTTGTGCCATTTTCTTCTTCCTCTTCTTTCGGTGGTCATGGTCGTGGCCGAGGCCGAGGTGGCTTTCCCTCCCATGCCGGACCCTTTGATTTCACTTCTCCAGTCCCCGGTCAAGAAGATTCAAATACGTCTAAACAAGATTCCGTAGGTTCTCGTCCTACTCCTGGCCTTGGACATGGTAAACCCACTCCTTCCTCCCCAATTCTTCCATCTTTCTCTTCCTTTTCGCCCTCTGTTAACCCATCGTCTGCTGGTCGAGGTAGAGTCCCGCCACCGATTCGTTCCCCTCCTGGTCCAGGTTCGTCGAAGGGGTCAGATTCGGAGCCTAAGAAACCTGTGTTTTTCTCGAAGGATAATGCGGGCGACTCGGCTTCAACTACTCGACCTGGCGCGTTACACAGGGGTGTGGGAGAAAGAAACTTACCTGATACTTTGCTTTCTGGATTTCCCGGTGTTGGACGAGGAAAACCTATGAAGCAACCAGGGCAGGAAGCTCAACCAAAGCAGGAGAATCGTCATCTCAGACCTAGTGGAGTTGGTGAGCCTGGAAGGGGTCGTGGCGGCGGCCAAAGGATGAGCCGTGATGGACCTGGGAGGAACACTGGTGGGATGATGTCAAGGATCGGGCCTGGTGGTGAAGATGGTGGTGATCGAGGGAGAAGCGGGCTCCGGGGCAGAGGAACGTTTCGAGGCAGAGGAGGATTCAGAGGGAGAGGAAGGGGGGGCATTAGAACTGGGGAGAGATGGGAGAGAGGAAGAGTTCAAGATATGGAGGATGGATATGCTGCAGGACTTTATTTAGGGGACAATGCAGACGGTGAGAAGCTGGCAAAGAGGATTGGGACTGAACATATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGTTAGAGTGCTGCCTTCACCACTGGAGGAAGAGTATTTGGGTGCGGTGCATACTAATTATATGGTGAGCACTTCCTGCGTCTTTTTTGTCCCTGTTTTTCCTCTCTAAGGACATAAAATGGACTGCGTATTAGGACTAGCTCTAAACTGTTTTGGCATGATTTTTTGAAACGGAAACAAATTTTCGTCGATACGAGGCTAGTGCTCCGAGTATAAAGAGACATAGGAATCATACAACTCGAATAAAACTATGGGTCAAGAGACGCACTAGGCTATATCAACTAGGTTGATACAATCTTAGCATCCTTGTCATTTCCCATTACAAAATACAACCTTAAGATAGAACATATAGTTTGTTGCGCAATATATCATATAGAATTAAGACATACTAACAAAAACAACAATGCTAAACAGTAAAAATGAAAGCATTGAGGCATATGTCTTGTATGAAATAATTTGCAAAGAACTTGGACTGTGAACTCCATGAAGAAGCTAGAACAAAAGTTCTGAAAGTATGGCTTCAACTGCATTTGACCATAAAACTTGCCTTTTGGATGAGAGTGAGGGGCCAGCCAACAGTTGAGATACATTATCTTGGAAAGAATTTCCAAAAGTCCAAGACAAATTAAAATATGAGAAGAGTTTTCACCAGCAATTAGTAGAAAATGGACACTCAAAAAATAGGAGCGACCTTATGCTGTAACATGGCAGCACAGTTAAGAGTGCCAAAAATCATAGTCCAAACCAGAATGTTGACTTTTTTGGGGCTCTTGGACTTCTATAATGCTTGAAAAATTATCTTATCTATAGGAGAGGAAGCAATTAAAAAAAGAAAAAAAGAAAAAACTGTTTTCGCATGATTGGAAAGTATACTGTTTTCGGTGTTTGAACAGTGTGCTGGTTGAATTATCACAAATCTGATTTTGTTTAAGTTTATCCTGAAAAATGTAGAGATTGGATTACTAATGCTCGTTTTGACTTTTTTCTGTATGGCAGATAGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAATCCTGATATTGATGAGAATCCACCTATTCCTCTTCGGGATGCACTTGAGAAGATGAAACCATTTTTAATGGCCTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGTATCATTTTCTGTTACCACTCCATGTTTTCATTCCTTTATTCTTGGGTCTTTTCTACTTGTTAATTTGTGCTCTGCATTTGAGTGAAGTGTATATGTGAGTTATTTTTGTGGTTTTTGAGCATTGATGAAAATATCCACTATACTCTACTTACGTTGGTATTTCCTCGATCTTGCTATTATTAGATATTATATTTATCTGATACAGGTAAAGATGCAGGAAGTGCGCAAAGTATATAGACGTCTTACATTAATTATTTAATTGCTCGTCTCAGTTGCTATTTCTCTAGAGGAGTTTTTAGCTGACATTTTTGAGAATTGCATCAATAAGCTTGCTTAGGATATTTATCAAGAAAATTGAAAGTTGTTCAAAAGAAAGGACTAGACACAAGAGCCTGAAGTAGAGGAGCCCACAAGAACCAAACAAGGAATTACTGGTAGTGGCGGCAGTTTTTAGAATGAAAAAACAATGGAAAGGTACTGAAAAACTCTGTGAGAACACTATGAGGGATTATTGATTCTTCTATCAATGAAACCTACATATGTTATATCCCTGAAAAGGAGAATGCAAGGAGAGTGAAAAACTTCAGATCCATTAACCATGTTACCAACATCTTATGAGATTGATTATTGCCAAGGTGTTGACATATTGATCGAGAAAGGTTCTCCCTAAGAATGTCTCAAAGTTTCACAATGCTTTTATGATGAGAGAGGTTATGGATGAGTTCTGGTTGGTAATGATGCCATTTTTTTGGAATAACTAACATTGTAAAGTACAAGACAGGTGATCCCCCATTACTACCCTTTGTCGTTAATGAATGCACAAACAAAATAACAGGGAGATACTAATATATTTTCATGATATAAAAAACCAGTTAACAGGAAATGGTCAGACCGATATTATGGAAAGATCAAGTACATTGACAGCCTAAATGCCATTACTATCATCTACATCTTCATCACTTGTCATCTTCCTCCTCTTTGCTGCCCTCATCTCCATCCTCATCATCGTTCACCTCTGACATTTACTTCTCAGATTCATCTTCTTTCGCAGCATTGGCTCCACTGGCCTGTTTCTTGTTATATGCCTTCATATTTTCTTCATATTCAACCTTCCTCTTGTCAGCCTTAGCATTGTAAGGTGCTTTTTCAGCATCCGACATTGATTTCCATTTCTGTCCAGCAGCTGTACCAACAGCAGACACTGCTTTGACATTCTTTTTTTGAAACACAAACTGCATAGCATTTTTCATTATTAAACTGATGAACAAAAAATCAGACAAACTAATTGTTTACCTCATTTTTCCCGCTCAAAAAACATCATGGAATATAAAACCCCCACAAATGAAGCACAAATTCAAGAACACAAAGCACATGAAGCAAACCACAGTTTGAACCATGCACAATCAATTCTCTCAAGACTTCTAAAATCTCCTCAATTCATCCATGGTCATTTTGTTTTTTGGCACTGGCTGTAAAGCAGTTCTCACCTTCCCCCCACCTCCAACAACTTTTATGTTGATGTTATTATTGTCCTTTTTGCTAAGCTGTTATTCGGATTCTCCTCATTGAACTTCTTCCTGAACTCTTCCATAAAAACGAAGAAGGCACTGGCAGGCCTCTTGGGTTTGTTAGGATCTTACCCTGCCTTCTTACTGCCTTTCTTGCCCACGTTGGCTGCACCTTTTTTCACTGCAAGCTTGGTGTCTGCTTTCTTCGACTCCGTCTTCAATTTACCACCTTTCATTTCGAATCCGAAGGGAGGAGGTTAGGTTAGGTTAGGATTAGGGTTTGAAGAGGGAGGAAGAAGCAGAGGTGCGTGTGAGAGAGAAAGAAGGAAGGGGTGGTTGGTAATGATGCTATTGAGAACCATATAGCTTGTTAGTGAGAAAGAATTGTTTTCAAGATTGATTTTGAGTAAGACGTTCTCCCCCTCCTTTTGTATCATTCTTTTTGATAATGAAAGTTGTGTTTGTTATGAAAAAGAAGGAGAAAAAAAAGAGATTGGTTTTGAGAGAGACTGATAATGTGGATTTGGGTTTCTTGGATAAGGTGGTTTGAATGAAAATTTTTGGGTTCAAGGGGAGATCTTGGTTATGGAATTGCATTAGATTTGTTAACTATCCTATTTCTGGTAAATAGTAAGCCTAGGGGTGGAATTTTTGCCTCTACTGGGACTGGGATTAGGCAAGAGACATTATTTCCTGTCCTTTTCCTAGTGGTTGGTGATGTCCTTGGTAGGTTAGGATGAATCAGGGTTTGGTTCGGGTTTTCACATGGGCGAAGAGTGGATTTGCCACTTTTTTAAATAATTAGAAAAATATTAGGCACTTGTATTAACCAAAAAAGAGCAGGCAACTCAAGCATAGGAGAAGGAGAAAACCCTCTCTGTACGGCCTTCCAATTGTGTCGTCCAAACCACGAAGGACTACGTGCAACCTTCCAATTGTGTGAAATCACAAGGAGGGTAAAATTTCTTGTGGTTGGAACACCACCAAGATGCTGTATACATCATTCCAAGAAAAAGACATGAGTAGTATTGCTATGCTAAAGGATCCTCTAATTCCTTTCCTTCCAAAGGAGCCAAAAAAGTACTAATGTTGCAGCTCCATGTGACTTTGGCTCTAGCTCTATTTCTCAGCCTTACTCCAGGTCTATCCAAACACTCCAACAACCAATCCTCAATTCAATTTGAAAGGCACGTTGAGAAATCAAATTCCTTGAGCAGCCTCATCCACCCCCTCAAAGCAGAAGGTCAATGGGTGAAAAACTCCCACTATTTAACAATGATGGCAGATCGAATCGAGGGAGAAAACATCCACCTAGAGGATTTTCTTTGGACTCTATTAAAGTATTCGCCGTGTCGAGACTTCTATACCAAAGAGTCCAAAGAAAAAACTTAACTTTATTTGGGATCTTAAATTTCCATATCATCCATACCATGAATTCACCTTTTTGTTTTTTAGTTTGTTGAGGACAATTTTTTCTTGCATGAGTAGGAATTCATTAATTGTTAAGAAAAATTTGCCCCTTTTTAGAGTCGTATGCTATTCAAGATCAATTGGAGAAAGGCTTCATTGGCCAGTATTAATTGTGATTTCGTTAAATTGGATGGATGAAGTTTTAAGTTGGTGTCATTTTCACTAATTGCCTTGGTCTTTCTCTTAGTCATTATTCCAATTGTCTCTACTTGTAAATCCTATTACAAAGAAGTTTATAAACAGCTTTGCACATGAAAAGTGCATTCTTCTTTGAAGGTGTTAGGATGATACTCATCTGGTCTGTGCTTGAGTGGGAGCTCGTCGTACTTCTCCTCTCTCTTTTAGTCGATCTCAAAGATCTTTTTTAATTATTCTGTAGGTACCCCCCAGCTAAAGCCCCTTTCTCTAGTTAGATTGCCTTCTGTGGGCTTGGCTTTTTGTATGCCCTTATATTTTTTTTCCTTCTTTCTTTCTCAATAAAAGTTGTTATCCAAAGAAAAAAAAAAATCACCTTTTAGTAGTTTTAAGGTGGAAAAAATGATAAGAGATTTTTTATCGTAGGGTGTGGAAGAGAGCTGGACGTTCCAGTTGAGTTAGGTGTGGGAGGTTTCTTTTAGAACAATTGGGGAAGGTATGAGGAATTGGGCATGAAAAACTTGAGGTTTGTTGTTTGCAGTACACCTTTGTAGTCTAAGTAGACGTGACATTTCCCTTTTGAGTTTGATGTTGTGTGGTGTAGGAGGGTAACGAGCAAACTGCAAAGATGTTTGTCACCAATCTGGTGGGTCTTGATGGTTGGATGTGAAGGAATTCTAGGAACGTTTGGAAGGTTGTGGTTGTCACATTTGTTTCCTTTTCTTTTACTAAGTTCTTGCAATTTTTAATTGGGTATGACTTGATTGTCCTATAGGAGGGATCTTAGGTAGGGGATAGTTCATTATGCCTTTGATTCCTTCACTTGTATCATCTTTCTTCTCATTATTCCCTGTAAAGGGATTCCAATTAATGTATATTTGGCTTTAAACTCTATTGTCTGATAAAAATAACTTTTGATTTCACCTATTTACTCTCCTCCTTGGGTGGGTTAGTGTTAGGAATGCTAGGGATATTAGGGCATGTTTGTCAGTAGCAAGGAGTTGGTTGGTGGAAGTCAATTATAAATAGAGGGGTTATGGGGTATTTCTACCAGTTTAATTGTTGTCTCGGGGATAGTGGGGGTGGGGGGTTTTCTTGTAAGGACTTTTTCTCTATATCATACTACTCTGATTCTTTTATGCTCTCTTTGTTCACTCTGGTGTAAAAGGTAAGAATCTCCAAGAAGGGCATGTTCTTTCTTTGACCCGTATTACATAGAGGATTAACATCATTATCAATTTATCACTGTCACCATTGGCTACCATCTAGAGGTAGCGTTTTGTGCTGTTGGAGCACATTGGTGCCCTTTATGTAAGATTCCTTTGGAGGACTTGTCACATTTTGTTGGGTTGTTAGTTCACACATTGGGTTTGCGGCAATTGTTTGGGAGCCTTCGGGGATGTGTCTGGTTCTTTTTAACAGCCTATCAAGTCACGTCTTCTGATAATTGAGGAGGCGTTGTATTGCTAGCCATTTTGGAATAAGGGAGAGATCCTTGATACGATCTTTTAATCAAGTCTAATTTTGGTTAAATTTGGTGCTTATATCTAGTTACTTTACTTCCTTAAGATGTTGGTCTACATTCATTTCATTGTGATATTATCTAGTGATTAGGATTTTTGGTGTTGGAGATGTTATATGTATATATATATATATGCGCCAACTGCTTTTTGTATTTTCATTATTTTCAAATATTGCTTTCCTTTGATTTAGAGTTAGAGAAGATTCCATCTGAAGATCTATACACTTGAAAATGTTTCCTTCTTAGCTAGTGCTTTAATGGGGTTGGAGAGGGTACTAAATATAAAGTTTCAGCCTGGAAGTAACTCTGTTCAGGTTTCAGGAATCCAAGAATTAGTTGAGTGATGTTATGTATGATTAGAAAGGCCAGCCTACTGTCAACATTTAGTCATGTTTGGCCTTGAGCTCTTTGAAATTTCGAACCTTCACATCAGAATGAAAAATATGGAGTTCAAATTTTATCAGCATCCTCCTGGATTGTCATTCATCGATCTCTCTCTCTCTCTCTCTCTCTATCTCCCTTCCTTTTTTTGTTTTTAAAAAACATTTCAACTGGAGACGCTTTCTTAACTCAGCTTTGGTTGGGAGGATGACTTGAAAATATGATACGCTCCTTATGAAGGATGGCTCTTCTTCTTCTTCATCTAACTTTTTCATTTGGCTGTCCCAGGAAATCATGGAAGAAACCATGCAAAGAGTTCCACTGCTTAAGGAGATAGTTGACTATTACAGTGGACCAGATAGAGTAACTGCAAAGCAACAACAAGGGGAGCTGGAAAGAGTTGCAAAAACACTTCCACAAAGTGCGCCTAATTCCATAAAGCAATTCACCAATCGTGCAGTTCTTTCTTTACAGGTTAGTGAAGCCACCTTAATGACTTATTATTCTGAATTATTTTTTGAAAAGAAAATAGGGAAAAGAAAAAAAAAAAAAAACCAGGAAGAAATTCTAAGCTGGAAACATACGTTTCAAATCACTATGGATGTATACTATATTTGTATGTATTTTCTTTCTCGGGAATGCAAGAAGACTATGTTTAAGAATTATTCCAAATTTGTAATGATCAATATCCATGAATGCCATGCATTATTTTTCATCCAAAAAAATGTATGCCATGCTTTACATATTTTATGAGTAATAGTTGCAATGGAAACAGCATTTGTAGCACTATATACAATTAAATGTTACCATCAAATAATATAGTGCATATTAATTTAGTATTAATAGTATTGAATAAAGTTTTACAAAAAAATCATTATTTTTACATTGAAGCTATTGCAGTTTTTTTTTCTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGATGGGGAAGGGGGGTACAATTGTATAATTGCATTCTTACTTGGTAGGAATATTGCATACCTACTTACAAGTTCCATTAATAGGTCTTACAAGTACACTGACAACTTCTACCTAAGTTATTTAGAATACATGTAATAATAGGAATGAATTGTTGGGATGTGATGTAAAGGGAACCTGTAAATTTCATACACCAATGAAGTTGTTTCTTATAGAAAAAAAAAAAGGAAAAAGAGAAGCATTGTAAAGTGAATTAAATATTTGTGTAGAATGAAGAGATGGCGATGCTATTCTGTCCTCTTTGTTTTGGATGTTCAGGGGAACTAAGTACTGATTTGTGAAACCCTGAGATGTGTGGGAACGCGAATGCTTGTTAACTAAACTAACATTGCTTTTCACTTTTCTAACCAAAGGAAAACCCTGAGAAAAGAGCATTGGGTAGAAATAAGAATGACGAACGGGCATAAAACTATGAAATGATATGGGCTTAGATGCACAGATACAATGCAACCAGAAAAGGAAAAAGAAAAAACAGAAATTTCATTTGGTTTTGCTGCAATAGTCTTAAACTGATATAAATGAACTGCTCTGTAATGATTTTATCTGTTTATGGAGCCAATATCTCATTTTACACAACTATTGTGTATATTGTTACACTAGGCAGCCTTATTTAATGTTGTTATGATCTACCTACATCCTGAAAGTCAAAATGTGTAAGAACAAGGCATTTTTAGTTCAGTCTCTGTACTCCAGAGGTTTAATTGACATTTCCTGTCTATGTTGCTTGTTAGTGAGTAATACAATAGGCTTTGACATTCAATATGAAGATGTTGAAAGGGATAGAAATGTGGAAGCTAATTTACACAATTCTTCTATTTGCAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTCATGGACAAGCTTGTAAGGGAGTTCTCCCAGCGATACAAGTAGGTTACTAAAGTTCCATCAAGGACATCAGGAATTACAAATTATACTCAACTAAAATGAAGCTACTTGCTTAATTGGTCTACATCTTTTCTTTATCCTTGGGTTTTAATTTTAGCTCATGAGGTGTTTAGAAGTTTTGCACTTTACCTTTCTGGTGTGTCAGAAGGGTGGTCTTTTGTAGTCCAATATTGCTGCAATAGCATTGTTGTACTGCTTCATTTTTTTGGAGGGCAATAATTGCATTTGTTTTGATCAACATTGTGCACTGAGAATTAGTCCTTTTATGTTGTACAATCTCATCCCTTGGCTACTACTCTTGCATTCTCCCATTCCTATCATAAGTTATACTTCTAAGAACTCATTTTTGTTGAGTTTGGGCAGTGGAGCAAGTTAGTATGTATAATGAATGCATATGGGTGAGTTGGTATGACATGGTTTTTGAAAACTATGCCAAATAGAAATTGAGTGTTAAATGAAGGCAGATAAAAAAACATATGTTCCTTGGATTTTTCTCTTTTTATCTACATTTTTTAATAGGCCCATTTTACGTATTACATTTGATTATGTATATTGGGAAGTAAACTTGGCCGATATTAATTAA
mRNA sequence
GTCCGTACAAATTTCCGTTTTTTGGTTCACGATCGCTTCGAATGGCCGGCGCTGCTAAATCCCAATCCTAAACCCTTTTTTTCTGTTCAACTACTGTGAGCTGTTCCTTTCTTTCCATTTCGTTCTTTTTCTCTGAAGCCTAGATGAGTAGATCAATCGGAAGAAAGGTTCCAGGTTTTTCACTTCTATCAAATGCCAACAAACTAAGCTTTGTGCCATTTTCTTCTTCCTCTTCTTTCGGTGGTCATGGTCGTGGCCGAGGCCGAGGTGGCTTTCCCTCCCATGCCGGACCCTTTGATTTCACTTCTCCAGTCCCCGGTCAAGAAGATTCAAATACGTCTAAACAAGATTCCGTAGGTTCTCGTCCTACTCCTGGCCTTGGACATGGTAAACCCACTCCTTCCTCCCCAATTCTTCCATCTTTCTCTTCCTTTTCGCCCTCTGTTAACCCATCGTCTGCTGGTCGAGGTAGAGTCCCGCCACCGATTCGTTCCCCTCCTGGTCCAGGTTCGTCGAAGGGGTCAGATTCGGAGCCTAAGAAACCTGTGTTTTTCTCGAAGGATAATGCGGGCGACTCGGCTTCAACTACTCGACCTGGCGCGTTACACAGGGGTGTGGGAGAAAGAAACTTACCTGATACTTTGCTTTCTGGATTTCCCGGTGTTGGACGAGGAAAACCTATGAAGCAACCAGGGCAGGAAGCTCAACCAAAGCAGGAGAATCGTCATCTCAGACCTAGTGGAGTTGGTGAGCCTGGAAGGGGTCGTGGCGGCGGCCAAAGGATGAGCCGTGATGGACCTGGGAGGAACACTGGTGGGATGATGTCAAGGATCGGGCCTGGTGGTGAAGATGGTGGTGATCGAGGGAGAAGCGGGCTCCGGGGCAGAGGAACGTTTCGAGGCAGAGGAGGATTCAGAGGGAGAGGAAGGGGGGGCATTAGAACTGGGGAGAGATGGGAGAGAGGAAGAGTTCAAGATATGGAGGATGGATATGCTGCAGGACTTTATTTAGGGGACAATGCAGACGGTGAGAAGCTGGCAAAGAGGATTGGGACTGAACATATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGTTAGAGTGCTGCCTTCACCACTGGAGGAAGAGTATTTGGGTGCGGTGCATACTAATTATATGATAGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAATCCTGATATTGATGAGAATCCACCTATTCCTCTTCGGGATGCACTTGAGAAGATGAAACCATTTTTAATGGCCTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGAAATCATGGAAGAAACCATGCAAAGAGTTCCACTGCTTAAGGAGATAGTTGACTATTACAGTGGACCAGATAGAGTAACTGCAAAGCAACAACAAGGGGAGCTGGAAAGAGTTGCAAAAACACTTCCACAAAGTGCGCCTAATTCCATAAAGCAATTCACCAATCGTGCAGTTCTTTCTTTACAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTCATGGACAAGCTTGTAAGGGAGTTCTCCCAGCGATACAAGCCCATTTTACGTATTACATTTGATTATGTATATTGGGAAGTAAACTTGGCCGATATTAATTAA
Coding sequence (CDS)
ATGAGTAGATCAATCGGAAGAAAGGTTCCAGGTTTTTCACTTCTATCAAATGCCAACAAACTAAGCTTTGTGCCATTTTCTTCTTCCTCTTCTTTCGGTGGTCATGGTCGTGGCCGAGGCCGAGGTGGCTTTCCCTCCCATGCCGGACCCTTTGATTTCACTTCTCCAGTCCCCGGTCAAGAAGATTCAAATACGTCTAAACAAGATTCCGTAGGTTCTCGTCCTACTCCTGGCCTTGGACATGGTAAACCCACTCCTTCCTCCCCAATTCTTCCATCTTTCTCTTCCTTTTCGCCCTCTGTTAACCCATCGTCTGCTGGTCGAGGTAGAGTCCCGCCACCGATTCGTTCCCCTCCTGGTCCAGGTTCGTCGAAGGGGTCAGATTCGGAGCCTAAGAAACCTGTGTTTTTCTCGAAGGATAATGCGGGCGACTCGGCTTCAACTACTCGACCTGGCGCGTTACACAGGGGTGTGGGAGAAAGAAACTTACCTGATACTTTGCTTTCTGGATTTCCCGGTGTTGGACGAGGAAAACCTATGAAGCAACCAGGGCAGGAAGCTCAACCAAAGCAGGAGAATCGTCATCTCAGACCTAGTGGAGTTGGTGAGCCTGGAAGGGGTCGTGGCGGCGGCCAAAGGATGAGCCGTGATGGACCTGGGAGGAACACTGGTGGGATGATGTCAAGGATCGGGCCTGGTGGTGAAGATGGTGGTGATCGAGGGAGAAGCGGGCTCCGGGGCAGAGGAACGTTTCGAGGCAGAGGAGGATTCAGAGGGAGAGGAAGGGGGGGCATTAGAACTGGGGAGAGATGGGAGAGAGGAAGAGTTCAAGATATGGAGGATGGATATGCTGCAGGACTTTATTTAGGGGACAATGCAGACGGTGAGAAGCTGGCAAAGAGGATTGGGACTGAACATATGAACCAACTGGTTGAAGGGTTTGAAGAGATGAGTGTTAGAGTGCTGCCTTCACCACTGGAGGAAGAGTATTTGGGTGCGGTGCATACTAATTATATGATAGAGTGTGAGCCGGAGTACTTGATGGGAGATTTTGAAAGTAATCCTGATATTGATGAGAATCCACCTATTCCTCTTCGGGATGCACTTGAGAAGATGAAACCATTTTTAATGGCCTATGAAAACATCCAAAGCCACGAAGAGTGGGAGGAAATCATGGAAGAAACCATGCAAAGAGTTCCACTGCTTAAGGAGATAGTTGACTATTACAGTGGACCAGATAGAGTAACTGCAAAGCAACAACAAGGGGAGCTGGAAAGAGTTGCAAAAACACTTCCACAAAGTGCGCCTAATTCCATAAAGCAATTCACCAATCGTGCAGTTCTTTCTTTACAGAGCAACCCAGGGTGGGGATTTGACAAGAAATGCCAGTTCATGGACAAGCTTGTAAGGGAGTTCTCCCAGCGATACAAGCCCATTTTACGTATTACATTTGATTATGTATATTGGGAAGTAAACTTGGCCGATATTAATTAA
Protein sequence
MSRSIGRKVPGFSLLSNANKLSFVPFSSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVPGQEDSNTSKQDSVGSRPTPGLGHGKPTPSSPILPSFSSFSPSVNPSSAGRGRVPPPIRSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGRGKPMKQPGQEAQPKQENRHLRPSGVGEPGRGRGGGQRMSRDGPGRNTGGMMSRIGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVREFSQRYKPILRITFDYVYWEVNLADIN
Homology
BLAST of CcUC07G138480 vs. NCBI nr
Match:
XP_038883040.1 (uncharacterized protein LOC120074102 [Benincasa hispida])
HSP 1 Score: 738.8 bits (1906), Expect = 3.1e-209
Identity = 405/489 (82.82%), Postives = 419/489 (85.69%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF--SSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVP 60
MSRSIGRKVPGFSLL NANKL FVPF SSSSSFGGHGRGRGRG PSH G DFTSPVP
Sbjct: 1 MSRSIGRKVPGFSLLPNANKLGFVPFSSSSSSSFGGHGRGRGRGDIPSHTGSSDFTSPVP 60
Query: 61 GQEDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGRV--PPP 120
GQEDSN SKQDS+ SRPTPGLGH GKP+ SS LPSF SFSPSV PSSAGRGRV P
Sbjct: 61 GQEDSNASKQDSLHSRPTPGLGHGRGKPSSSSSNLPSFPSFSPSVKPSSAGRGRVDASPS 120
Query: 121 IRSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGV 180
IR PP P SE KKPVFFSKDNAGDSA++TR G H+GVGER LPDTLLSGF GV
Sbjct: 121 IRFPPEP------VSELKKPVFFSKDNAGDSAASTRLGTPHKGVGERILPDTLLSGFTGV 180
Query: 181 GRGKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMSRDGPGRNTGGMMS 240
GRGKPMKQ EAQPK ENRH+RP G GEPGR RGGGQ MSRD PGRNTG M+S
Sbjct: 181 GRGKPMKQQVPEAQPKLENRHVRPRQEGGGRGAGEPGRSRGGGQGMSRDEPGRNTGRMVS 240
Query: 241 RIGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGLY 300
R GP GE GG RGRSG + RG RGRG +RGRGRG RTG+R RGRVQD EDGYAAGLY
Sbjct: 241 RGGPDGEYGGGRGRSGFQSRG--RGRGTYRGRGRGEFRTGDRGGRGRVQDTEDGYAAGLY 300
Query: 301 LGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYLM 360
LGDNADGEKLAKRIG EHMNQLVEGFEEMS RVLPSPLEEEYL A+HTNYMIECEPEYLM
Sbjct: 301 LGDNADGEKLAKRIGPEHMNQLVEGFEEMSGRVLPSPLEEEYLDAMHTNYMIECEPEYLM 360
Query: 361 GDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDY 420
GDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDY
Sbjct: 361 GDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDY 420
Query: 421 YSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKL 478
YSGPDRVTAKQQQGELERVAKTLPQ+APNS+KQFTNRAVLSLQSNPGWGFDKKCQFMDKL
Sbjct: 421 YSGPDRVTAKQQQGELERVAKTLPQTAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKL 480
BLAST of CcUC07G138480 vs. NCBI nr
Match:
XP_022136793.1 (uncharacterized protein LOC111008406 [Momordica charantia])
HSP 1 Score: 723.8 bits (1867), Expect = 1.0e-204
Identity = 404/494 (81.78%), Postives = 417/494 (84.41%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF---SSSSSFGGHGRGRGRGGFPSH--AGPFDFTS 60
MSRSIGRKVPG S L NA KLSFVPF SSSSS GGHGRGRGRGG PSH GPFDFTS
Sbjct: 1 MSRSIGRKVPGLSFLPNATKLSFVPFSSSSSSSSSGGHGRGRGRGGSPSHGGGGPFDFTS 60
Query: 61 PVPGQEDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGRV-- 120
VPGQEDSN SKQ+SV S T GLGH GKP PSSPILPSFSSF+PSV SSAGRGRV
Sbjct: 61 RVPGQEDSNASKQESVDSPGTSGLGHGRGKPGPSSPILPSFSSFTPSVKSSSAGRGRVAG 120
Query: 121 PPPIRSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGF 180
PPIRS P GSSK SD EPKKPVFFSKDNA +SA++TR GAL RGVGERNLPD+LLS
Sbjct: 121 SPPIRSTPESGSSKQSDLEPKKPVFFSKDNAANSAASTRLGALDRGVGERNLPDSLLSVL 180
Query: 181 PGVGRGKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMSRDGPGRNTGG 240
GVGRGKPMKQP E QPKQENRHLRP G+G P RG GGG RMSRD RNTG
Sbjct: 181 SGVGRGKPMKQPVPEDQPKQENRHLRPRQESGGRGIGGPVRGVGGGPRMSRDEGVRNTGR 240
Query: 241 MMSRIGPGGED-GGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYA 300
M+SR GP GED GG RGR G RGRG FRGRGGFRGRGRG RTGER ERGR QDMEDGYA
Sbjct: 241 MVSRGGPDGEDGGGGRGRGGFRGRGGFRGRGGFRGRGRGPFRTGERGERGRAQDMEDGYA 300
Query: 301 AGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEP 360
AGLYLGDNADGEKLAKRIGTE+MNQLVEGFEEMS R LPSPLEEEYL +HTNYMIECEP
Sbjct: 301 AGLYLGDNADGEKLAKRIGTENMNQLVEGFEEMSGRTLPSPLEEEYLDGMHTNYMIECEP 360
Query: 361 EYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETM-QRVPLLK 420
EYLMGDFESNPDIDE PPIPLRD LE KPFLMAYENIQSHEEWEEI+EE M QRVPLLK
Sbjct: 361 EYLMGDFESNPDIDEKPPIPLRDVLEMTKPFLMAYENIQSHEEWEEIVEEIMQQRVPLLK 420
Query: 421 EIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQ 478
EIVDYYSGPDRVTAKQQQ ELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFD+KCQ
Sbjct: 421 EIVDYYSGPDRVTAKQQQEELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDRKCQ 480
BLAST of CcUC07G138480 vs. NCBI nr
Match:
XP_022980643.1 (uncharacterized protein LOC111479946 [Cucurbita maxima])
HSP 1 Score: 708.4 bits (1827), Expect = 4.4e-200
Identity = 391/488 (80.12%), Postives = 407/488 (83.40%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF-SSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVPG 60
MSRSIGRKVPG S LSNANKL FVPF SSSSS GGHGRGRGR G P+H GPFDF+S VPG
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSGGHGRGRGRAGSPTHGGPFDFSSRVPG 60
Query: 61 QEDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGR--VPPPI 120
QEDSN SK +SV SR T GLGH GKP+PSS ILPS SSF+PSV SSAGRGR PI
Sbjct: 61 QEDSNESKHESVDSRGTSGLGHGRGKPSPSSSILPSLSSFTPSVKSSSAGRGRGDGSQPI 120
Query: 121 RSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVG 180
RSPP SS GSDSE KKPVFFSKDNA DSA + RPGAL R VGERNLPD+ LS G G
Sbjct: 121 RSPPESRSSNGSDSERKKPVFFSKDNAADSAGSARPGALDRDVGERNLPDSFLSVLSGAG 180
Query: 181 RGKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMSRDGPGRNTGGMMSR 240
RGKPMKQP E+QPKQENRHLRP G PGRG GGG R+SRD RNTG MMSR
Sbjct: 181 RGKPMKQPIPESQPKQENRHLRPRQEAGGRGGSGPGRGSGGGPRISRDESVRNTGRMMSR 240
Query: 241 IGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGLYL 300
GP GEDGG RGR G FRGRG FRGRGRG RTGER ERGR QDMEDGYA+GLYL
Sbjct: 241 GGPDGEDGGGRGRGG------FRGRGRFRGRGRGAFRTGERGERGRGQDMEDGYASGLYL 300
Query: 301 GDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYLMG 360
GDNADGEKLAKRIGTEHMNQLVEG EEMS RVLPSPLEE Y+ A+ NYMIECEPEYLMG
Sbjct: 301 GDNADGEKLAKRIGTEHMNQLVEGXEEMSGRVLPSPLEEGYVEAMDMNYMIECEPEYLMG 360
Query: 361 DFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYY 420
DFESNPDIDENPPIPLRDALEKMKPFLMAYE IQSHEEWEEI+EETMQRVPLLKEIVD Y
Sbjct: 361 DFESNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVDSY 420
Query: 421 SGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLV 478
SGPDRVTAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 421 SGPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDKLV 480
BLAST of CcUC07G138480 vs. NCBI nr
Match:
XP_023544535.1 (uncharacterized protein LOC111804080 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 708.0 bits (1826), Expect = 5.8e-200
Identity = 391/490 (79.80%), Postives = 410/490 (83.67%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF--SSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVP 60
MSRSIGRKVPG S LSNANKL FVPF SSSSS GGHG+GRGRGG P+H GPFDF+S VP
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSSGGHGQGRGRGGSPTHGGPFDFSSRVP 60
Query: 61 GQEDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGR--VPPP 120
GQEDSN SK +SV SR T GLGH GKP+PSS ILPS SSF+PSV SSAGRGR P
Sbjct: 61 GQEDSNESKHESVDSRGTSGLGHGRGKPSPSSSILPSLSSFTPSVKSSSAGRGRGDGSQP 120
Query: 121 IRSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGV 180
IRSPP SS GSDSE KKPVFFSKDNAGDSA + RPGAL GERNLPD+ LS G
Sbjct: 121 IRSPPESRSSNGSDSERKKPVFFSKDNAGDSAGSARPGALGGDAGERNLPDSFLSVLSGA 180
Query: 181 GRGKPMKQPGQEAQPKQENRHLRP-------SGVGEPGRGRGGGQRMSRDGPGRNTGGMM 240
GRGKPMKQP E+QPKQENRHLRP G G PGRG GGG R+SRD RNTG MM
Sbjct: 181 GRGKPMKQPIPESQPKQENRHLRPRQEAGGRGGYG-PGRGSGGGPRISRDESVRNTGRMM 240
Query: 241 SRIGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGL 300
SR GP GEDGG RGR G RGRG G FRGRGRG RTGER +RGR QDMEDGYA+GL
Sbjct: 241 SRGGPDGEDGGGRGRGGFRGRG-----GRFRGRGRGAFRTGERGQRGRGQDMEDGYASGL 300
Query: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYL 360
YLGDNADGEKLAKRIGTEHMNQLVEGFEEMS RVLPSPLEE Y+ A+ TNYMIECEPEYL
Sbjct: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYL 360
Query: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVD 420
MGDFESNPDIDENPPIPLRDALEKMKPFLMAYE I+SHEEWEEI+EETMQRVPLLKEIVD
Sbjct: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYEGIRSHEEWEEIVEETMQRVPLLKEIVD 420
Query: 421 YYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDK 478
YSGPDRVTAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDK
Sbjct: 421 SYSGPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDK 480
BLAST of CcUC07G138480 vs. NCBI nr
Match:
XP_022942601.1 (uncharacterized protein LOC111447586 [Cucurbita moschata])
HSP 1 Score: 705.7 bits (1820), Expect = 2.9e-199
Identity = 391/490 (79.80%), Postives = 408/490 (83.27%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF--SSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVP 60
MSRSIGRKVPG S LSNANKL FVPF SSSSS GGHGRGRGRGG P+H GPFDF+S VP
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSSGGHGRGRGRGGSPTHGGPFDFSSRVP 60
Query: 61 GQEDSNTSKQDSVGSRPTPGL--GHGKPTPSSPILPSFSSFSPSVNPSSAGRGR--VPPP 120
GQEDSN SK +SV SR T GL GHGKP+PSS ILPS SSF+PSV S AGRGR P
Sbjct: 61 GQEDSNESKHESVDSRGTSGLGHGHGKPSPSSSILPSLSSFTPSVKSSFAGRGRGDGSQP 120
Query: 121 IRSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGV 180
IRSPP SS SDSE KPVFFSKDNAGDSA + RPGAL R VGER+LPD+ LS G
Sbjct: 121 IRSPPESRSSNESDSERTKPVFFSKDNAGDSAGSARPGALDRDVGERHLPDSFLSVLSGA 180
Query: 181 GRGKPMKQPGQEAQPKQENRHLRP-------SGVGEPGRGRGGGQRMSRDGPGRNTGGMM 240
GRGKPMKQP EAQPKQENRHLRP G G PGRG GGG R+SRD RNTG MM
Sbjct: 181 GRGKPMKQPVPEAQPKQENRHLRPRQEAGGRGGYG-PGRGSGGGPRISRDESVRNTGRMM 240
Query: 241 SRIGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGL 300
R GP GEDGG RGR G RGRG G FRGRGRG RTGER +RGR QDMEDGYA+GL
Sbjct: 241 PRGGPDGEDGGGRGRGGFRGRG-----GRFRGRGRGAFRTGERGQRGRGQDMEDGYASGL 300
Query: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYL 360
YLGDNADGEKLAKRIGTEHMNQLVEGFEEMS RVLPSPLEE Y+ A+ TNYMIECEPEYL
Sbjct: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYL 360
Query: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVD 420
MGDFESNPDIDENPPIPLRDALEKMKPFLMAYE IQSHEEWEEI+EETMQRVPLLKEIVD
Sbjct: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVD 420
Query: 421 YYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDK 478
YSGPDRVTAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDK
Sbjct: 421 SYSGPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDK 480
BLAST of CcUC07G138480 vs. ExPASy TrEMBL
Match:
A0A6J1C8I7 (uncharacterized protein LOC111008406 OS=Momordica charantia OX=3673 GN=LOC111008406 PE=4 SV=1)
HSP 1 Score: 723.8 bits (1867), Expect = 5.0e-205
Identity = 404/494 (81.78%), Postives = 417/494 (84.41%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF---SSSSSFGGHGRGRGRGGFPSH--AGPFDFTS 60
MSRSIGRKVPG S L NA KLSFVPF SSSSS GGHGRGRGRGG PSH GPFDFTS
Sbjct: 1 MSRSIGRKVPGLSFLPNATKLSFVPFSSSSSSSSSGGHGRGRGRGGSPSHGGGGPFDFTS 60
Query: 61 PVPGQEDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGRV-- 120
VPGQEDSN SKQ+SV S T GLGH GKP PSSPILPSFSSF+PSV SSAGRGRV
Sbjct: 61 RVPGQEDSNASKQESVDSPGTSGLGHGRGKPGPSSPILPSFSSFTPSVKSSSAGRGRVAG 120
Query: 121 PPPIRSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGF 180
PPIRS P GSSK SD EPKKPVFFSKDNA +SA++TR GAL RGVGERNLPD+LLS
Sbjct: 121 SPPIRSTPESGSSKQSDLEPKKPVFFSKDNAANSAASTRLGALDRGVGERNLPDSLLSVL 180
Query: 181 PGVGRGKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMSRDGPGRNTGG 240
GVGRGKPMKQP E QPKQENRHLRP G+G P RG GGG RMSRD RNTG
Sbjct: 181 SGVGRGKPMKQPVPEDQPKQENRHLRPRQESGGRGIGGPVRGVGGGPRMSRDEGVRNTGR 240
Query: 241 MMSRIGPGGED-GGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYA 300
M+SR GP GED GG RGR G RGRG FRGRGGFRGRGRG RTGER ERGR QDMEDGYA
Sbjct: 241 MVSRGGPDGEDGGGGRGRGGFRGRGGFRGRGGFRGRGRGPFRTGERGERGRAQDMEDGYA 300
Query: 301 AGLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEP 360
AGLYLGDNADGEKLAKRIGTE+MNQLVEGFEEMS R LPSPLEEEYL +HTNYMIECEP
Sbjct: 301 AGLYLGDNADGEKLAKRIGTENMNQLVEGFEEMSGRTLPSPLEEEYLDGMHTNYMIECEP 360
Query: 361 EYLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETM-QRVPLLK 420
EYLMGDFESNPDIDE PPIPLRD LE KPFLMAYENIQSHEEWEEI+EE M QRVPLLK
Sbjct: 361 EYLMGDFESNPDIDEKPPIPLRDVLEMTKPFLMAYENIQSHEEWEEIVEEIMQQRVPLLK 420
Query: 421 EIVDYYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQ 478
EIVDYYSGPDRVTAKQQQ ELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFD+KCQ
Sbjct: 421 EIVDYYSGPDRVTAKQQQEELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDRKCQ 480
BLAST of CcUC07G138480 vs. ExPASy TrEMBL
Match:
A0A6J1IZW9 (uncharacterized protein LOC111479946 OS=Cucurbita maxima OX=3661 GN=LOC111479946 PE=4 SV=1)
HSP 1 Score: 708.4 bits (1827), Expect = 2.2e-200
Identity = 391/488 (80.12%), Postives = 407/488 (83.40%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF-SSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVPG 60
MSRSIGRKVPG S LSNANKL FVPF SSSSS GGHGRGRGR G P+H GPFDF+S VPG
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSGGHGRGRGRAGSPTHGGPFDFSSRVPG 60
Query: 61 QEDSNTSKQDSVGSRPTPGLGH--GKPTPSSPILPSFSSFSPSVNPSSAGRGR--VPPPI 120
QEDSN SK +SV SR T GLGH GKP+PSS ILPS SSF+PSV SSAGRGR PI
Sbjct: 61 QEDSNESKHESVDSRGTSGLGHGRGKPSPSSSILPSLSSFTPSVKSSSAGRGRGDGSQPI 120
Query: 121 RSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVG 180
RSPP SS GSDSE KKPVFFSKDNA DSA + RPGAL R VGERNLPD+ LS G G
Sbjct: 121 RSPPESRSSNGSDSERKKPVFFSKDNAADSAGSARPGALDRDVGERNLPDSFLSVLSGAG 180
Query: 181 RGKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMSRDGPGRNTGGMMSR 240
RGKPMKQP E+QPKQENRHLRP G PGRG GGG R+SRD RNTG MMSR
Sbjct: 181 RGKPMKQPIPESQPKQENRHLRPRQEAGGRGGSGPGRGSGGGPRISRDESVRNTGRMMSR 240
Query: 241 IGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGLYL 300
GP GEDGG RGR G FRGRG FRGRGRG RTGER ERGR QDMEDGYA+GLYL
Sbjct: 241 GGPDGEDGGGRGRGG------FRGRGRFRGRGRGAFRTGERGERGRGQDMEDGYASGLYL 300
Query: 301 GDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYLMG 360
GDNADGEKLAKRIGTEHMNQLVEG EEMS RVLPSPLEE Y+ A+ NYMIECEPEYLMG
Sbjct: 301 GDNADGEKLAKRIGTEHMNQLVEGXEEMSGRVLPSPLEEGYVEAMDMNYMIECEPEYLMG 360
Query: 361 DFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYY 420
DFESNPDIDENPPIPLRDALEKMKPFLMAYE IQSHEEWEEI+EETMQRVPLLKEIVD Y
Sbjct: 361 DFESNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVDSY 420
Query: 421 SGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLV 478
SGPDRVTAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 421 SGPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDKLV 480
BLAST of CcUC07G138480 vs. ExPASy TrEMBL
Match:
A0A6J1FPB3 (uncharacterized protein LOC111447586 OS=Cucurbita moschata OX=3662 GN=LOC111447586 PE=4 SV=1)
HSP 1 Score: 705.7 bits (1820), Expect = 1.4e-199
Identity = 391/490 (79.80%), Postives = 408/490 (83.27%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPF--SSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVP 60
MSRSIGRKVPG S LSNANKL FVPF SSSSS GGHGRGRGRGG P+H GPFDF+S VP
Sbjct: 1 MSRSIGRKVPGLSFLSNANKLGFVPFSSSSSSSSGGHGRGRGRGGSPTHGGPFDFSSRVP 60
Query: 61 GQEDSNTSKQDSVGSRPTPGL--GHGKPTPSSPILPSFSSFSPSVNPSSAGRGR--VPPP 120
GQEDSN SK +SV SR T GL GHGKP+PSS ILPS SSF+PSV S AGRGR P
Sbjct: 61 GQEDSNESKHESVDSRGTSGLGHGHGKPSPSSSILPSLSSFTPSVKSSFAGRGRGDGSQP 120
Query: 121 IRSPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGV 180
IRSPP SS SDSE KPVFFSKDNAGDSA + RPGAL R VGER+LPD+ LS G
Sbjct: 121 IRSPPESRSSNESDSERTKPVFFSKDNAGDSAGSARPGALDRDVGERHLPDSFLSVLSGA 180
Query: 181 GRGKPMKQPGQEAQPKQENRHLRP-------SGVGEPGRGRGGGQRMSRDGPGRNTGGMM 240
GRGKPMKQP EAQPKQENRHLRP G G PGRG GGG R+SRD RNTG MM
Sbjct: 181 GRGKPMKQPVPEAQPKQENRHLRPRQEAGGRGGYG-PGRGSGGGPRISRDESVRNTGRMM 240
Query: 241 SRIGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGL 300
R GP GEDGG RGR G RGRG G FRGRGRG RTGER +RGR QDMEDGYA+GL
Sbjct: 241 PRGGPDGEDGGGRGRGGFRGRG-----GRFRGRGRGAFRTGERGQRGRGQDMEDGYASGL 300
Query: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYL 360
YLGDNADGEKLAKRIGTEHMNQLVEGFEEMS RVLPSPLEE Y+ A+ TNYMIECEPEYL
Sbjct: 301 YLGDNADGEKLAKRIGTEHMNQLVEGFEEMSGRVLPSPLEEGYVEAMDTNYMIECEPEYL 360
Query: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVD 420
MGDFESNPDIDENPPIPLRDALEKMKPFLMAYE IQSHEEWEEI+EETMQRVPLLKEIVD
Sbjct: 361 MGDFESNPDIDENPPIPLRDALEKMKPFLMAYEGIQSHEEWEEIVEETMQRVPLLKEIVD 420
Query: 421 YYSGPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDK 478
YSGPDRVTAKQQQGELERVAKTLPQSAPNS+K+FTNRAVLSLQSNPGWGFDKKCQFMDK
Sbjct: 421 SYSGPDRVTAKQQQGELERVAKTLPQSAPNSVKKFTNRAVLSLQSNPGWGFDKKCQFMDK 480
BLAST of CcUC07G138480 vs. ExPASy TrEMBL
Match:
A0A5D3CZK6 (Translation initiation factor IF-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G001880 PE=4 SV=1)
HSP 1 Score: 661.4 bits (1705), Expect = 3.0e-186
Identity = 369/487 (75.77%), Postives = 390/487 (80.08%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPFSSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVPGQ 60
MSRSIGRKVPGFSLLSNANKL VPFSSSSS GGHGRGRGRG FPS GPFDFT PVP Q
Sbjct: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSS-GGHGRGRGRGAFPS--GPFDFTPPVPSQ 60
Query: 61 EDSNTSKQDSVGSRPTPGLGHGK--PTPSSPILPSFSSFSPSVNPSSAGRGR--VPPPIR 120
E N SKQ+ + SRPTPGLGHG+ PTPSSPI PSFSSFSPSV PSS GRGR P IR
Sbjct: 61 EHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIR 120
Query: 121 SPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGR 180
SPP P DSEPKKPVFFS++NAGDSA++T G LHR GERNLPD+L SGF GVGR
Sbjct: 121 SPPEP------DSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGR 180
Query: 181 GKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMSRDGPGRNTGGMMSRI 240
GKPMKQP E QPKQENRHLRP G G GR RG R+ R P RNT M SR
Sbjct: 181 GKPMKQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRG 240
Query: 241 GPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGLYLG 300
GP GE GG RG SG RGRG RG FR RG RTGERW+R QD EDGYAAGLYLG
Sbjct: 241 GPDGEVGGGRGSSGYRGRGV---RGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLG 300
Query: 301 DNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYLMGD 360
+N DGE+LAK++G E MNQLVEGFEEMS RVLPSPLE+ L + N+MIECEPEYLMGD
Sbjct: 301 NNEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGD 360
Query: 361 FESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYS 420
FESNPDIDENPPI LRDA EKMKPFLMAYENIQSHEEWEEI+EETMQ VPL+KEIVD YS
Sbjct: 361 FESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYS 420
Query: 421 GPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVR 478
GPDRVTAK+QQGELERVAKTLPQSAPNS+KQFTNRAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 421 GPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVG 475
BLAST of CcUC07G138480 vs. ExPASy TrEMBL
Match:
A0A1S3BT69 (translation initiation factor IF-2 OS=Cucumis melo OX=3656 GN=LOC103492997 PE=4 SV=1)
HSP 1 Score: 661.4 bits (1705), Expect = 3.0e-186
Identity = 369/487 (75.77%), Postives = 390/487 (80.08%), Query Frame = 0
Query: 1 MSRSIGRKVPGFSLLSNANKLSFVPFSSSSSFGGHGRGRGRGGFPSHAGPFDFTSPVPGQ 60
MSRSIGRKVPGFSLLSNANKL VPFSSSSS GGHGRGRGRG FPS GPFDFT PVP Q
Sbjct: 1 MSRSIGRKVPGFSLLSNANKLGVVPFSSSSS-GGHGRGRGRGAFPS--GPFDFTPPVPSQ 60
Query: 61 EDSNTSKQDSVGSRPTPGLGHGK--PTPSSPILPSFSSFSPSVNPSSAGRGR--VPPPIR 120
E N SKQ+ + SRPTPGLGHG+ PTPSSPI PSFSSFSPSV PSS GRGR P IR
Sbjct: 61 EHPNASKQEPIDSRPTPGLGHGRGIPTPSSPIRPSFSSFSPSVRPSSVGRGRGDASPSIR 120
Query: 121 SPPGPGSSKGSDSEPKKPVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGFPGVGR 180
SPP P DSEPKKPVFFS++NAGDSA++T G LHR GERNLPD+L SGF GVGR
Sbjct: 121 SPPEP------DSEPKKPVFFSRNNAGDSAASTSLGGLHRVSGERNLPDSLHSGFSGVGR 180
Query: 181 GKPMKQPGQEAQPKQENRHLRP------SGVGEPGRGRGGGQRMSRDGPGRNTGGMMSRI 240
GKPMKQP E QPKQENRHLRP G G GR RG R+ R P RNT M SR
Sbjct: 181 GKPMKQPVPEDQPKQENRHLRPRQEGDGRGAGGRGRARGVEPRIGRGEPWRNTNRMASRG 240
Query: 241 GPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAAGLYLG 300
GP GE GG RG SG RGRG RG FR RG RTGERW+R QD EDGYAAGLYLG
Sbjct: 241 GPDGEVGGGRGSSGYRGRGV---RGPFRRGPRGSFRTGERWDRRSGQDKEDGYAAGLYLG 300
Query: 301 DNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPEYLMGD 360
+N DGE+LAK++G E MNQLVEGFEEMS RVLPSPLE+ L + N+MIECEPEYLMGD
Sbjct: 301 NNEDGERLAKKVGPEIMNQLVEGFEEMSGRVLPSPLEDRLLDGMDINFMIECEPEYLMGD 360
Query: 361 FESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEIVDYYS 420
FESNPDIDENPPI LRDA EKMKPFLMAYENIQSHEEWEEI+EETMQ VPL+KEIVD YS
Sbjct: 361 FESNPDIDENPPISLRDAFEKMKPFLMAYENIQSHEEWEEIVEETMQSVPLMKEIVDAYS 420
Query: 421 GPDRVTAKQQQGELERVAKTLPQSAPNSIKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVR 478
GPDRVTAK+QQGELERVAKTLPQSAPNS+KQFTNRAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 421 GPDRVTAKEQQGELERVAKTLPQSAPNSVKQFTNRAVLSLQSNPGWGFDKKCQFMDKLVG 475
BLAST of CcUC07G138480 vs. TAIR 10
Match:
AT1G53645.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 294.7 bits (753), Expect = 1.4e-79
Identity = 224/552 (40.58%), Postives = 293/552 (53.08%), Query Frame = 0
Query: 1 MSRSIGRKVP---GFSLLSNANKLSFVP-----FSSSSSFGGHGRGRGR---GGFPSHAG 60
M +IGR+ GF++ S + F+ FSSSS G GRGRG GGFP+ AG
Sbjct: 1 MRSAIGRRFSNPNGFTIASLVKQTPFLTQSTSHFSSSSDSSGRGRGRGSGEDGGFPA-AG 60
Query: 61 PFDF----TSPVPGQEDSNTSKQDSVGSRPTPGLGHGKPTPSSPILPSFSSFSPSVNPSS 120
F VPG+E S+ G G G+P S I P+F+SF S +P S
Sbjct: 61 RGQFGVNREPVVPGREPSSAGGY---------GHGRGRPIQSDSISPAFTSFVKSDSP-S 120
Query: 121 AGRGR------------VPPPIRSPPGP----GSSKGSDSEPKK---------------- 180
GRGR PP +SPP P S+ S+P++
Sbjct: 121 IGRGRGSVGSDTVSPFAAEPPRQSPPPPQQQQSQSQQQRSQPQQQQPRSQPQQQPNDESQ 180
Query: 181 --PVFFSKDNAGDSASTTRPGALHRGVGERNLPDTLLSGF-------PGVGRGKPMKQPG 240
PVF D+ S+ P G+ + PD + + G GRGKP+ +
Sbjct: 181 GSPVFVKLQEMQDATSSPPPP--ESKPGQADPPDNIFNALGNEFSHPSGAGRGKPLVESA 240
Query: 241 -------------------QEAQPKQENRHLRPSGVGEPGRGRGGGQRMSRDGPGRNTGG 300
Q QP+Q+ G +P ++S + GR
Sbjct: 241 PIRQEDNRQIRRPPPPPQQQRVQPQQKRAPTVKDGTPKP--------QLSAEEAGRRARS 300
Query: 301 MMSRIGPGGEDGGDRGRSGLRGRGTFRGRGGFRGRGRGGIRTGERWERGRVQDMEDGYAA 360
+SR G +G G G RGRG RGRG RGRGRG R G+ W + ++ + A
Sbjct: 301 ELSR---GEAEGSSVGGRGGRGRG--RGRGA-RGRGRG--RGGDGWRDDKKEEEGEQEAM 360
Query: 361 GLYLGDNADGEKLAKRIGTEHMNQLVEGFEEMSVRVLPSPLEEEYLGAVHTNYMIECEPE 420
++ GD+ADGEK A+++G E M L EGFEE+ + LPS + + A TN MIECEPE
Sbjct: 361 RIFAGDSADGEKFAEKMGPELMKTLAEGFEEICEKALPSTTHDAIIDAYDTNLMIECEPE 420
Query: 421 YLMGDFESNPDIDENPPIPLRDALEKMKPFLMAYENIQSHEEWEEIMEETMQRVPLLKEI 478
Y+M DF SNPDIDE PP+ LR+ LEK+KPF++AYE I+ EEWEE + E M + PL+KEI
Sbjct: 421 YIMPDFGSNPDIDEKPPMSLRECLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEI 480
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C8I7 | 5.0e-205 | 81.78 | uncharacterized protein LOC111008406 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1IZW9 | 2.2e-200 | 80.12 | uncharacterized protein LOC111479946 OS=Cucurbita maxima OX=3661 GN=LOC111479946... | [more] |
A0A6J1FPB3 | 1.4e-199 | 79.80 | uncharacterized protein LOC111447586 OS=Cucurbita moschata OX=3662 GN=LOC1114475... | [more] |
A0A5D3CZK6 | 3.0e-186 | 75.77 | Translation initiation factor IF-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E56... | [more] |
A0A1S3BT69 | 3.0e-186 | 75.77 | translation initiation factor IF-2 OS=Cucumis melo OX=3656 GN=LOC103492997 PE=4 ... | [more] |
Match Name | E-value | Identity | Description | |
AT1G53645.1 | 1.4e-79 | 40.58 | hydroxyproline-rich glycoprotein family protein | [more] |