Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTGAACTTAATCCCTACGATCGAGCTTCTTCATTTTCTCCCTTGCTTTCTTCAGAAGCATATGGATTTTGCGTATGGATCATCAATCAAAGCCATGGAATCAATCGGCTTCTTCTTCTACTAGGTAATCTATGTTCTTGTTTGCTTGCTTTTGTCAAGCTTCTTCTTGTTCTATGTAGTTCTAAGTTACTAGTCACTCTTGCTGATCCATGGCGTTTGATTTTACTTTAGCTTCTCTTTCTAGGAGCTAGGGATCGATGTTAGTGAAGAGCGAAGAAATCTTATTGTTGAATTTCCTTTGACGTTATGAAGAATCTATTTCGCCTTTCGGTTGGATTTTTGCTGGTGCTGCTAGTTTTTTACTGCATCTGTGTTGATTCGAAGGTACGGAAGGTTGATCTCGATTTGTTAGGGTTGAATTTATCATTTCTGTTTAGTGTCTAGTTGGCTGTGGATTTGAATGTAGGTGGAAGAAAGTGCAAATTTGGGTCTAGATTTGAAAACAGTGAATAAAGGGAATGATGCAAGCAAGGAGAAGAAGGATGAACAACAAGTAAGCGTTTCAAAGGAGGGTGTTAAGAGCAGTGGGGATAAGATGAAGAAGGATCCTGAAAGTGAAACTGCATCTGAAGAAGGTGCTAAGGAAGTAAAGAATAAGGGGGAGAAAGAAAAGGGGAAGCCAGTGGATAATTCAGCCTCAAGGGAAGCATCTAAGAGTAGTGGGAAGGACGGCAGTACATCGAAAAGAAAGGATGGTTCTTCGGGTGAGGACTGTGATTCATCTAATAAGTGCAGTGATGAGGAAAACAAATTGGTGGCTTGCTTACGGGTTCCAGGAAATGGTAATGCATCTACAGCAATTGAATCTCAGTTTAATTTTCTTACTAATATCATGGTTCTATGAATTTTTTTTCTTGGAACGATTTCAATACTTTATCTTATTTATTTCTGGAAACGGTCTGAATATTATGTGATGATGATTTGAATTTTGAATAGGTTCCCATATATTTTTTTTAAAATGTGGGGCTGGTGGGTGGGATGAGAGTTGCATAATCTTGCCTTGATCACAGCATAACTTGAAATCAGATTGCATTCATGAGTAGGAAGGTTATATGTATTATAATTCTTGGGTAAAAAATATTTTTGGTCTCTAAAAATCTTACGTAACAATTTGATATCTAAATTTTAGGTTGTAACAATATAGTCCATATATTTTAAGTTTTGATACCTAGTGTTTTAAAAGTCGCACCCAGGCGCATGCCTAGGTCGAAGGTTAGCGCCTCGACTGTACCTGAATTTTAATTTGATGTCTAATGTGAGCTTGCCTCCAACTGAGTGACTTTTTGGTGCTGTTATGAATGTGCATCAACCAATTTATATGCAGTGACGAGTGAATTATCAAATCTGATCTCTAAAGGATTAAAGGGTTTCTGTGAACTAAGCCTAAATTAATGTTAAGAATATTAGCTGACTTTTTGTGATAAACTATTGGAATAACTATAGTGATTCAATTGGTCAAAGAGGGTCTTGGGGTGCGTAAGGTGATGATTCAATCCATGGTGGTCACCTACCTAGGAATTAATTATCCACGAGTTTCCTTGGCATCCAAATGTTGTAGGGTAAGATAGTATGCCCTGTGACATTAGAAAGTTGGCCCGAACATCTTTCAACTGAAAGTGAACCTAACATCTTTCAGCTGTATTTACCATTTCATGGTGCATTCATTGCACTATTAAGATCCCATATTGATATCATAAATCATTTCTATTTTTTAGTTTATATTTTTGTTTCAAATCTTACGTTTTTTGTATACAATTTTGCAGATTCTCCTGATTTTTCGCTTCTAATCCAGAACAAGGGAACAGGGCCTCTTACTGTGAAAATTTCTGCCCCAGATTTTGTTCAACTGGAGAAGCATGAAGTTCAACTTCAAGAGAAAGAAGATAAAAAGGTATTTAGCACACTACAATGAATACTTAATATAACATAACTTTAAATTATGAGAAGTTTAAGTTAGATTTTATTTTGATGGAGTGATTAATGACTAACCAATGGAAATCCCCAATCATCTATCTGATTCTCTTTCTAGATATAAATCATTTCTCGAGACTTCCTTTATACTCGGTGCTGGTTTTTTTTCTCCCTTTTATATTGCTTATCAATTTTTTATGCATCTAATTATTTTTTCTTTAGGCCTAGAAACATCCTCGTGAGCGTCCTAAGAACATTACCAGAAAGACGTCTATGAGAGTAGTTAAGACATACCCTAAATTAAGACCGTCATATGTGTTCAAAAAAAAAAAATTAAGACCGTCATCAAAAGATAATATGAGACGCTCCATCCATTTGATTCTCTCTCTAGATATAAATCATTTCTCTAGAAAGCTCTGTAGTTGCTCGCTAACTCTAGTCAAACCATACACCATCCTATTGGGAAAAAACCTTCCTTAATAACAGCATGAATAAGCAGAAAATAATTGAGAAATTAAAAGGAAATAACAAAGCAGAGGACATAAGAATTTACTTGGGAAAACTCCAAATTGGAGAAAAAAACCACGGACAGAAAGAACTTCCACTATGTGAAAAATTGCTACAATCACACATAATTCTCTCCCCGACCCCAATTACAAAAGCACTCTCTTCAAAACTTATACTACTCACACCTTTTGCCCACTTTCCACAAGAGAATACAAAAGGAATTTAACTAGAGTCAATACACAAAGTTTAAAATGCTTTTGACTGGGATGCTTTGAAACCAAAGCCTTATGCTCCTTTTATAGTGCTTGGAGCACACCACTACCTTGCATCTTTCCGATGTGGGACAAATTCACTTCTAATATTTTGCCAAAAAACCCAACAAATTCCACCTTGGCAAGATATTTGAAGAATCTACATCTTCTGCTGCAACATTGTCTTCTCTGACAATCATAATTCTCAACCCAGAGAATTATAACATACTCCACCATAAAAAGTACAACACTTGAAATCTCACATTCCAAGATTTTTCTTCTTCGTCACATGTCGATACACCTTGCTGAAATTTATGGTGCAACTTCCACCTACTTGGCTTACCTGGAAGTTCTTCAGCCCATCGACATAACTTCCACCACACACCTTGCATAACCGCCAAGCCAATGCCCGTGTGCAAACTTGTGGAACCGCTGACATCACATCCTCTACCATGGCAAAGCCGACATACCATGAGGATGATTTCTTTCTTTCTTCAGAGGGCGTCACTAGATCCCCGTACGAGAGAGACCACGTCAGCATTTGAACCAGAGCCCTTTGACGAACTTGCTCTGTTAGGACAATTGTTTCTCATGTGACCAGGTTGTCCACATTCCCAACAACTTCTCTTTTGCAAAAAGTTTTTCTTGTTCTTCCAATTGCCAACTACCAGTGCTGAATCCCCTTGTGAAGTACGCTCATCAAACTTCAGCCTTCTTTCTTCTGATAGAAGTTTACTAGTAGCCTCTGCAAAATTTAAAGACTCTTTTCCGTACATCAAGATTGGCTTCATATGTTCATAAGAAAGGGGAAGTGACAAGATAAGCCTTAGTGCCTTATCCTCGTCCTCTATCTTCACACCAATCACCTCCAGCTCAGAGAAAACACCATTGAGAACACTCAGATGATTTGAGATTTTCGTACCTTCTTCCATTCACAATGTGTAAAATTGCTCCTTCAGATAGACCTGATTTGAAATGCTCTTTGCCTGATACATCGCTTCAAGCTTCTCCCAAAGCTCTTGGCTGTTGATATTCCATGCACATTAACTAGAACATTTTTAGCCAAATTTAACCTGATGGCACTTGCAGCCCTCAAATCCATTTCCTCCCATTCGTCATCACTTATGCTGGATTTCTTAGAACCTTTGCTGGAACCACCACTGGACTCAGTTGAACCACCATCACCGCTTGATTTTTCAGAATCACCACCACTTGGTCTTCCCTTCAAAGCTTTGTGTAATCCAGATTGTATTAGCACATCTTTAACTTGCACTTGCCACAAGCCAAAATTCATCCTTCTATCAAATTTCTCTATATCAAACTTCACTGAACTCATAAAGCTTGCCATTTTTGCTTCAATCCTAGCACATTTGCACATAATGAACCACCCAGTTGTAACTTGTTTCAAGAATACTTTTCTGATGTGGAAGCTCAGACAAAGCTGCAACCACAGAGCATACTCAGAAATTCAAGAAACTACAAACCTAAAGCTCTGATACCACTTGTTGGGAAAAAACCTTCCCTAATAACAGCAGGAATAAGCAAAAAATAATTGAGAAATTAAAAGGAAATAACAAAGCAGAGGACACAAGAATTTACGTGGGAAAACTCCAAATTGGAGAAAAAAACCACGGACAGAAAGAACTTCTACTATGTGAAAAATTGCTACAATCACACAGAATTCTCTCCCCGACCCCAATTACAAAAACACTCTCTTCAAAACTTATACTACTCACACCCTTTGCCCACTTTCTACGAGAGAATACAAAAGGAATTTAACTAGAGTCAATACACAAAGTTTAAAATGCTTTTGACTGAGATGCTTTGAAACCAAAGTCTTATGCTCCTTTTAGTGCTTGGAGCACACCACTACCTTGCATCTTTCCGATGTGGGACAAATTCACTTCTAATATTTTGCCAAAAAACCCAACACATCCACCTTTCTGAAATGTATCGAATCTATTAATTATTGGTTGGGGAGTGTGTTTACTATTATTATTTTTGGTTATCTATGTAGTCATTGCAGATTCAATGAAGGGCTTACTATTAAGGATAAAGAGTTTTTTTTTTTATTTTTTATCGAAGATAGTAAATTTCTTTTTGGCAAGAATAGGAAGGTTCTCATCACCCTTCCCCTGGGGATATAGTAAAAGCATTTCATAGAACTGAAAGAATACAGAAATAGGAGGATAAATATAGTCTCCCGAGAGAGTCAAGTATATTAAGGATCTCACATCGGAAAGAGTGGAGACCTCAAAACAATATATAAGAGATGGGTTAAGGGGTACTTTCCTCATTGTCAATTAATTTTGAGGTGAACCTCATATACTTTAATAAAATAGAGAGAGTGAGACTTGTGTCAGACAAATGAAAGATAGGATACAGCGACAAATTTATAGAAACTTACTTTTATAGTTTCTTAAATTTTATAAAATTTTTAGAAACTATACTAAAAGAATTAAAAATAAGAAACAAGTTGGTTACTAAATATGGTTGTTTCCCATCTAAAAAAGGAAGAAATAGAAACCAGAAACAAGACAGTACCAAACGGGCCATAATAAAACCAACGTCTAGGCCTTAATTTCCATATTACTTGCGAGAAAAAAACTGCCCATCAAAAAGATTACAAAGAGAAAAGATCAAAAGATTCAAAGGTGAGAATTATAAAAGGCTTCTCCAATGAGCTTGAAGAGTGGTGTGACTATAGTTATTAAACATGGAGGAATGTTTACACCAATTCATCGCCATTATGAATAGAGGTGTTCATACATTCCCCGAAAACTAATTCAGGCGACCAAATTAGGGTTGGTCAGCTGGTTTCCAAGTATAAGTCGGTTGGGTTGGTTAGAAAACTAGTATTAGAAAATCTGTCTGTTAAGTCGGTCGGTAGGTTTACCGATATTTTTTAATATAATATTAATATTTTCATATAGTTAATCAAATTTCTTATTAAATGATTGCAAAATTAACTTAAATTTAAATGTCTTTCCGAAAATTATGGGGTATGCTACTTTTAAAAAATTAATTCTCAAAATTTCATTAAAATTGTATGGTGAATTTATTTTTGAGAAATATTTGCATCATGGTCAGGCATGATGTATAAAATAATGCCCACCCGTGTGGTTTAATGTGGGCCATAAGTTATTTCATTTTAAAGTCAAAAGATAAAAGAAATAGAATAAAAAAAAATAGATTATGGCCTATATTGAGCCACATAGGTGAACATGATTTTATACATCATGCCCGGGTCTGATGTAAATATTTCTCTTTATTTTTTTGATAAAAAAGGAAAAAAATATATGAATTAACCGATTTTTACCAACTTAACCGACTAAAATGATCAACTTAATCGACCACTTAACCTGTCGGTTAGTTTTGGTGTCGCTTGTAAGCAAATCGATAGGTCGGTTTAGTGTTAATTTTCACCAAAAACTGGCATTGATCGACCGATGAACACCCCTGTTTATTTTAGATGATTCTTTTCCTCAAAGATTCCGATCGGGCCCAATATTCCAACAAAATGCTAAACAAAATTTTCCCATAAATACTTATTTTCCCTTACGAAAGGATGACCGCCAAGAGAATAGGAATTAGGATAGAAGCGTACTTGGTTTCATGGGAAGCATTCTATACCAACCAAAAAATGTCAATGAATAATCCCAATCATCTCTTGGCAAATGGGCAATTGAGGAAGATACGTGTCTGAAACTTCATGTTTTGCTTACCTAATATGTGTCTGAAGCTTGTCTTGAGTATTGATGTGGCCTTGTGAGGCAATTCCCAAAAGAAGAACTTGGTCTTTTTAGGGTAGGATCCCTTTGACTCAAAAAGGGGGAAAACTTCTTTGTAAGGGGTGGGTATAAATCCATTAGAAGAGACTTTGAAAAATAAACCGCCTTCATCCAACTTCCAAAGCTCCCCATCACAGCCCTGAGTAAAAGTAAACGTGGAAAGGCAGTTTAGGAGGGTCCTAATTCTTCAATCTCGGCTTTATTCATGTAAATTATGTCTTGTAGGCTCTGTTTGATAATCATTTTGTTTTTTGTTTGTGAAATTTAAGACTATTTCTATTCAAATTTCCGACCATGTGCTCCATCTTTCCTACAATGTATCCATCATTCCTTAAGAAAGTAGGAGAATATTAACCAAATTTCAAAAACATAAACAAGTTTTGGGAAGCTACTTTTTTGTTTACAAATTTTGGTTTTGTTTATTAAAATGTAGGTTAGATGTAGATATCCAAATAAGGAAAAAAGTATAGTAAGGTAGTTGTTGTAGACTTAAATTTCATAAACAAAAAACGAAAAACAAAATGGTTATCAAATGGGGCCGTAATAATATAGAATAGCTGCTAATTATAAGATTCACGTGGTAAAAGTTTCATCATAATATACACATCCAAATAATCACATTCTGCAGGTAAAATTTTCAATCGGTGCTGAACAAGGTGACAACACGATCGTCGTTCTTTCCGCAGGTAGCGGCCATTGTAATCTTGATTTCAGGAATCTGATTATGCCCAATGTTGGCAAAGATTCTGATAATCTTCCCAAGTCCTCACGGTTCAGCTACCTCTCAAAACCACCTGTTATTGCAATTTTAGCTTTTGCTATAATTCTGACATTTGCAGCTGCTCTAGTTTTCATTAGCATTCGTCGTAAGATTTCTGTAAGTAGCAATTCAAAATATCAAAGGTTGGACATGGAGCTCCCTGTTTCCATTGGAGGTAAATCTGCTGCTGACAATAACGACGGATGGGAAAACAGTTGGGATGACAATTGGGACGACGAGGCACCGCACACACCGTCCCTGCCTCTTACTCCAAATCTTTCGTCGAAGGGCCTTGCCTCTCGACGGTTGAACAAGGAAGGCTGGAAAGATTAGTCACACCATTAGCTTCACAAGCACACATACATAAACATAGAACAAGCCACTCAAAAGCCAATCCAATTATCTGGGAGTTTTGGTCAGGATTTCATCATACAAGGTACACCCCTCTACCCTTTTTTTAAGTACAGCCCTGTTAATTTAAACCAACAAGCATATGTTTGTCATTTAGTATTTGCATCCAACTTCTGTAATTATGATCTCCTCTTTATTTTTCACAATTGGATTGGGAGGGATGTGGAGAATCAGAACCTCATAGATATTTACTTCTGGTAGAATAGTTTTTACAATTTCTATTCTCGATTTTTTCAAATATCCGTGAAGCACAATTGGACTATTCTCTTGGGATGACTATCGACTCCATTTCATTTCGTTGTCGATGCAACTTTAAGGGTGTGTTTGTCTTTGGTAAGAAAGTGGTAAACAATCACCCTTCTAAGATTAGTGTTTAATTGTTTCGGATTTGGCAGTAAACAACCATCCTTAACCATCCATCAAATCTTTCATCAAGTCTCACATCAAATCCTACATCAAATTCCACAACAAACTTATCATTCAACCACTACTAACATTGATGTCCAAATGAATAGTTGATTACCTCAGTCATTACAAACACTAGTCTCCAAATAAGTATTTAATTACCTCAACCATATAAACGAGTGACTAATTACCCCAACTACCATTAACCCTTCCACTCTATTCTTTTTTACCAAATTCAAATACACCCATGAAATTATTCTGTAGGTAAATATACCACAATAATTTATTCATTCTACACATTTATTATTTTCAAGGTTTACTCGATAGTTTTTTTCGAGCAATACGTACCTAAGGGTGGGCAAATATTTTGAAAATACTTGCCCACCTTAAAATCACATATTTTTATTTTTTTTATTATTTTTTTTAAATTGACTTTAATTTATGGAGAGAGAAATGTTAAAATGGAGAGAG
mRNA sequence
GTTGAACTTAATCCCTACGATCGAGCTTCTTCATTTTCTCCCTTGCTTTCTTCAGAAGCATATGGATTTTGCGTATGGATCATCAATCAAAGCCATGGAATCAATCGGCTTCTTCTTCTACTAGGAGCTAGGGATCGATGTTAGTGAAGAGCGAAGAAATCTTATTGTTGAATTTCCTTTGACGTTATGAAGAATCTATTTCGCCTTTCGGTTGGATTTTTGCTGGTGCTGCTAGTTTTTTACTGCATCTGTGTTGATTCGAAGTGTCTAGTTGGCTGTGGATTTGAATGTAGGTGGAAGAAAGTGCAAATTTGGGTCTAGATTTGAAAACAGTGAATAAAGGGAATGATGCAAGCAAGGAGAAGAAGGATGAACAACAAGTAAGCGTTTCAAAGGAGGGTGTTAAGAGCAGTGGGGATAAGATGAAGAAGGATCCTGAAAGTGAAACTGCATCTGAAGAAGGTGCTAAGGAAGTAAAGAATAAGGGGGAGAAAGAAAAGGGGAAGCCAGTGGATAATTCAGCCTCAAGGGAAGCATCTAAGAGTAGTGGGAAGGACGGCAGTACATCGAAAAGAAAGGATGGTTCTTCGGGTGAGGACTGTGATTCATCTAATAAGTGCAGTGATGAGGAAAACAAATTGGTGGCTTGCTTACGGGTTCCAGGAAATGATTCTCCTGATTTTTCGCTTCTAATCCAGAACAAGGGAACAGGGCCTCTTACTGTGAAAATTTCTGCCCCAGATTTTGTTCAACTGGAGAAGCATGAAGTTCAACTTCAAGAGAAAGAAGATAAAAAGGTAAAATTTTCAATCGGTGCTGAACAAGGTGACAACACGATCGTCGTTCTTTCCGCAGGTAGCGGCCATTGTAATCTTGATTTCAGGAATCTGATTATGCCCAATGTTGGCAAAGATTCTGATAATCTTCCCAAGTCCTCACGGTTCAGCTACCTCTCAAAACCACCTGTTATTGCAATTTTAGCTTTTGCTATAATTCTGACATTTGCAGCTGCTCTAGTTTTCATTAGCATTCGTCGTAAGATTTCTGTAAGTAGCAATTCAAAATATCAAAGGTTGGACATGGAGCTCCCTGTTTCCATTGGAGGTAAATCTGCTGCTGACAATAACGACGGATGGGAAAACAGTTGGGATGACAATTGGGACGACGAGGCACCGCACACACCGTCCCTGCCTCTTACTCCAAATCTTTCGTCGAAGGGCCTTGCCTCTCGACGGTTGAACAAGGAAGGCTGGAAAGATTAGTCACACCATTAGCTTCACAAGCACACATACATAAACATAGAACAAGCCACTCAAAAGCCAATCCAATTATCTGGGAGTTTTGGTCAGGATTTCATCATACAAGGTACACCCCTCTACCCTTTTTTTAAGTACAGCCCTGTTAATTTAAACCAACAAGCATATGTTTGTCATTTAGTATTTGCATCCAACTTCTGTAATTATGATCTCCTCTTTATTTTTCACAATTGGATTGGGAGGGATGTGGAGAATCAGAACCTCATAGATATTTACTTCTGGTAGAATAGTTTTTACAATTTCTATTCTCGATTTTTTCAAATATCCGTGAAGCACAATTGGACTATTCTCTTGGGATGACTATCGACTCCATTTCATTTCGTTGTCGATGCAACTTTAAGGGTGTGTTTGTCTTTGGTAAGAAAGTGGTAAACAATCACCCTTCTAAGATTAGTGTTTAATTGTTTCGGATTTGGCAGTAAACAACCATCCTTAACCATCCATCAAATCTTTCATCAAGTCTCACATCAAATCCTACATCAAATTCCACAACAAACTTATCATTCAACCACTACTAACATTGATGTCCAAATGAATAGTTGATTACCTCAGTCATTACAAACACTAGTCTCCAAATAAGTATTTAATTACCTCAACCATATAAACGAGTGACTAATTACCCCAACTACCATTAACCCTTCCACTCTATTCTTTTTTACCAAATTCAAATACACCCATGAAATTATTCTGTAGGTAAATATACCACAATAATTTATTCATTCTACACATTTATTATTTTCAAGGTTTACTCGATAGTTTTTTTCGAGCAATACGTACCTAAGGGTGGGCAAATATTTTGAAAATACTTGCCCACCTTAAAATCACATATTTTTATTTTTTTTATTATTTTTTTTAAATTGACTTTAATTTATGGAGAGAGAAATGTTAAAATGGAGAGAG
Coding sequence (CDS)
ATGAAGAAGGATCCTGAAAGTGAAACTGCATCTGAAGAAGGTGCTAAGGAAGTAAAGAATAAGGGGGAGAAAGAAAAGGGGAAGCCAGTGGATAATTCAGCCTCAAGGGAAGCATCTAAGAGTAGTGGGAAGGACGGCAGTACATCGAAAAGAAAGGATGGTTCTTCGGGTGAGGACTGTGATTCATCTAATAAGTGCAGTGATGAGGAAAACAAATTGGTGGCTTGCTTACGGGTTCCAGGAAATGATTCTCCTGATTTTTCGCTTCTAATCCAGAACAAGGGAACAGGGCCTCTTACTGTGAAAATTTCTGCCCCAGATTTTGTTCAACTGGAGAAGCATGAAGTTCAACTTCAAGAGAAAGAAGATAAAAAGGTAAAATTTTCAATCGGTGCTGAACAAGGTGACAACACGATCGTCGTTCTTTCCGCAGGTAGCGGCCATTGTAATCTTGATTTCAGGAATCTGATTATGCCCAATGTTGGCAAAGATTCTGATAATCTTCCCAAGTCCTCACGGTTCAGCTACCTCTCAAAACCACCTGTTATTGCAATTTTAGCTTTTGCTATAATTCTGACATTTGCAGCTGCTCTAGTTTTCATTAGCATTCGTCGTAAGATTTCTGTAAGTAGCAATTCAAAATATCAAAGGTTGGACATGGAGCTCCCTGTTTCCATTGGAGGTAAATCTGCTGCTGACAATAACGACGGATGGGAAAACAGTTGGGATGACAATTGGGACGACGAGGCACCGCACACACCGTCCCTGCCTCTTACTCCAAATCTTTCGTCGAAGGGCCTTGCCTCTCGACGGTTGAACAAGGAAGGCTGGAAAGATTAG
Protein sequence
MKKDPESETASEEGAKEVKNKGEKEKGKPVDNSASREASKSSGKDGSTSKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAPDFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSDNLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSIGGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD
Homology
BLAST of Sed0007649.4 vs. NCBI nr
Match:
XP_038883447.1 (uncharacterized protein LOC120074402 [Benincasa hispida])
HSP 1 Score: 422.9 bits (1086), Expect = 2.1e-114
Identity = 230/293 (78.50%), Postives = 250/293 (85.32%), Query Frame = 0
Query: 1 MKKDPESETASEEGAKEVK----------NKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKDPES+T SEEGA +VK NKGEKEKGKPVDNS S+EASKSSGK ST
Sbjct: 85 IKKDPESKTISEEGANKVKKDGGLGEEGRNKGEKEKGKPVDNSVSKEASKSSGKGESTVS 144
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SKRKDGSSGEDCDSSNKC+DE NKLVACLRVPGNDSP SLLIQNKGTGPLTVKISAP
Sbjct: 145 SESKRKDGSSGEDCDSSNKCTDEGNKLVACLRVPGNDSPQLSLLIQNKGTGPLTVKISAP 204
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DF+ LEK EVQLQEKEDKKVK SIG + GD ++L+AGSG C+LDFR+LI+ N KDSD
Sbjct: 205 DFIHLEKSEVQLQEKEDKKVKVSIG-DGGDGNAIILTAGSGRCSLDFRDLIVHNNAKDSD 264
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
N+ KSSRFSYL+KP +IAILAFA+ILT AAA VFISIRRK SSNSKYQRLDMELPVSI
Sbjct: 265 NVSKSSRFSYLTKPHIIAILAFAVILTIAAASVFISIRRKNFASSNSKYQRLDMELPVSI 324
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
GGKS ADNNDGWENSWDDNWDDE PHTPSLP+TP+LSSKGLASRRLNKEGW+D
Sbjct: 325 GGKSVADNNDGWENSWDDNWDDETPHTPSLPVTPSLSSKGLASRRLNKEGWRD 376
BLAST of Sed0007649.4 vs. NCBI nr
Match:
XP_022942101.1 (uncharacterized protein LOC111447272 isoform X2 [Cucurbita moschata])
HSP 1 Score: 417.5 bits (1072), Expect = 8.8e-113
Identity = 230/293 (78.50%), Postives = 246/293 (83.96%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SEEGA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 82 IKKDHESETVSEEGANEAKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 141
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 142 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 201
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSGHCNLDFR+LI N K SD
Sbjct: 202 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGHCNLDFRDLISHNNAKVSD 261
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KP VIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 262 NLPKSSRFSYLTKPHVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 321
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 322 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 373
BLAST of Sed0007649.4 vs. NCBI nr
Match:
XP_022942100.1 (uncharacterized protein LOC111447272 isoform X1 [Cucurbita moschata])
HSP 1 Score: 417.5 bits (1072), Expect = 8.8e-113
Identity = 230/293 (78.50%), Postives = 246/293 (83.96%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SEEGA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 85 IKKDHESETVSEEGANEAKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 144
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 145 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 204
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSGHCNLDFR+LI N K SD
Sbjct: 205 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGHCNLDFRDLISHNNAKVSD 264
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KP VIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 265 NLPKSSRFSYLTKPHVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 324
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 325 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 376
BLAST of Sed0007649.4 vs. NCBI nr
Match:
XP_022981480.1 (uncharacterized protein LOC111480580 isoform X2 [Cucurbita maxima])
HSP 1 Score: 415.2 bits (1066), Expect = 4.4e-112
Identity = 229/293 (78.16%), Postives = 245/293 (83.62%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SE GA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 82 IKKDHESETVSEGGANEVKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 141
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 142 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 201
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSG CNLDFR+LI N K SD
Sbjct: 202 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGRCNLDFRDLISHNNAKVSD 261
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KPPVIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 262 NLPKSSRFSYLTKPPVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 321
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 322 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 373
BLAST of Sed0007649.4 vs. NCBI nr
Match:
XP_022981471.1 (uncharacterized protein LOC111480580 isoform X1 [Cucurbita maxima])
HSP 1 Score: 415.2 bits (1066), Expect = 4.4e-112
Identity = 229/293 (78.16%), Postives = 245/293 (83.62%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SE GA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 85 IKKDHESETVSEGGANEVKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 144
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 145 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 204
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSG CNLDFR+LI N K SD
Sbjct: 205 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGRCNLDFRDLISHNNAKVSD 264
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KPPVIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 265 NLPKSSRFSYLTKPPVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 324
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 325 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 376
BLAST of Sed0007649.4 vs. ExPASy TrEMBL
Match:
A0A6J1FVL0 (uncharacterized protein LOC111447272 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447272 PE=4 SV=1)
HSP 1 Score: 417.5 bits (1072), Expect = 4.3e-113
Identity = 230/293 (78.50%), Postives = 246/293 (83.96%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SEEGA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 85 IKKDHESETVSEEGANEAKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 144
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 145 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 204
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSGHCNLDFR+LI N K SD
Sbjct: 205 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGHCNLDFRDLISHNNAKVSD 264
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KP VIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 265 NLPKSSRFSYLTKPHVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 324
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 325 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 376
BLAST of Sed0007649.4 vs. ExPASy TrEMBL
Match:
A0A6J1FMW7 (uncharacterized protein LOC111447272 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111447272 PE=4 SV=1)
HSP 1 Score: 417.5 bits (1072), Expect = 4.3e-113
Identity = 230/293 (78.50%), Postives = 246/293 (83.96%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SEEGA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 82 IKKDHESETVSEEGANEAKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 141
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 142 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 201
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSGHCNLDFR+LI N K SD
Sbjct: 202 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGHCNLDFRDLISHNNAKVSD 261
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KP VIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 262 NLPKSSRFSYLTKPHVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 321
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 322 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 373
BLAST of Sed0007649.4 vs. ExPASy TrEMBL
Match:
A0A6J1IZL6 (uncharacterized protein LOC111480580 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111480580 PE=4 SV=1)
HSP 1 Score: 415.2 bits (1066), Expect = 2.1e-112
Identity = 229/293 (78.16%), Postives = 245/293 (83.62%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SE GA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 82 IKKDHESETVSEGGANEVKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 141
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 142 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 201
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSG CNLDFR+LI N K SD
Sbjct: 202 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGRCNLDFRDLISHNNAKVSD 261
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KPPVIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 262 NLPKSSRFSYLTKPPVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 321
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 322 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 373
BLAST of Sed0007649.4 vs. ExPASy TrEMBL
Match:
A0A6J1IWN0 (uncharacterized protein LOC111480580 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111480580 PE=4 SV=1)
HSP 1 Score: 415.2 bits (1066), Expect = 2.1e-112
Identity = 229/293 (78.16%), Postives = 245/293 (83.62%), Query Frame = 0
Query: 1 MKKDPESETASEEGA----------KEVKNKGEKEKGKPVDNSASREASKSSGKDGST-- 60
+KKD ESET SE GA KEVKNKGEKEKGKPVDNS S+E KSSGKDGST
Sbjct: 85 IKKDHESETVSEGGANEVKKDGDLRKEVKNKGEKEKGKPVDNSVSKEKFKSSGKDGSTES 144
Query: 61 --SKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
SK KD SSGEDCDSSNKC+DE NKLVACLRVPGN+SPD SLLIQNKGTGPLTVKI+AP
Sbjct: 145 SASKGKDISSGEDCDSSNKCTDEGNKLVACLRVPGNESPDLSLLIQNKGTGPLTVKITAP 204
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EV+LQEKEDKKVK SIG + GD +VL+ GSG CNLDFR+LI N K SD
Sbjct: 205 DFVHLEKSEVKLQEKEDKKVKVSIG-DGGDGHTIVLTTGSGRCNLDFRDLISHNNAKVSD 264
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
NLPKSSRFSYL+KPPVIAILAFA+ILTFAA +VFISIR K S NSKYQRLDMELPVSI
Sbjct: 265 NLPKSSRFSYLTKPPVIAILAFAVILTFAATVVFISIRSKSFTSGNSKYQRLDMELPVSI 324
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
G+S ADNNDGWENSWDDNWDDE PHTP+LP+TPNLSSKGLASRRLNKEGWKD
Sbjct: 325 TGQSVADNNDGWENSWDDNWDDETPHTPTLPVTPNLSSKGLASRRLNKEGWKD 376
BLAST of Sed0007649.4 vs. ExPASy TrEMBL
Match:
A0A5D3CXF4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21G003830 PE=4 SV=1)
HSP 1 Score: 412.9 bits (1060), Expect = 1.0e-111
Identity = 223/293 (76.11%), Postives = 246/293 (83.96%), Query Frame = 0
Query: 1 MKKDPESETASEEGAKEVK----------NKGEKEKGKPVDNSASREASKSSGKD----G 60
+KKDPESET S+EGA +VK NKGEKEKGKPVDNS S+E SKSSGK
Sbjct: 85 IKKDPESETVSKEGADKVKKDDGIGEEGRNKGEKEKGKPVDNSVSKEGSKSSGKGESTVS 144
Query: 61 STSKRKDGSSGEDCDSSNKCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAP 120
STSKR DGSSGEDCDSSNKC+DE +LVACLRVPGNDSP SLLIQNKG GPLTVKISAP
Sbjct: 145 STSKRNDGSSGEDCDSSNKCTDEAKRLVACLRVPGNDSPQLSLLIQNKGKGPLTVKISAP 204
Query: 121 DFVQLEKHEVQLQEKEDKKVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSD 180
DFV LEK EVQLQE+EDKKVK SIG + GD + ++L+AGSGHC+LDFR+LI N KDSD
Sbjct: 205 DFVHLEKSEVQLQEEEDKKVKVSIG-DGGDGSTIILTAGSGHCSLDFRDLIAHNNAKDSD 264
Query: 181 NLPKSSRFSYLSKPPVIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVSI 240
N+PKSS FSYL+KP VIAILAF +ILT AA +FI+IRRK VSSNSKYQRLDMELPVS+
Sbjct: 265 NVPKSSWFSYLTKPHVIAILAFGVILTIAAVSLFITIRRKNFVSSNSKYQRLDMELPVSL 324
Query: 241 GGKSAADNNDGWENSWDDNWDDEAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
GGK+ ADNNDGWENSWDDNWDDE PHTPSLP+TPNLSSKGLASRRLNK+GWKD
Sbjct: 325 GGKAVADNNDGWENSWDDNWDDETPHTPSLPVTPNLSSKGLASRRLNKDGWKD 376
BLAST of Sed0007649.4 vs. TAIR 10
Match:
AT1G64385.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; Has 66 Blast hits to 66 proteins in 21 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 4; Plants - 51; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink). )
HSP 1 Score: 191.8 bits (486), Expect = 7.3e-49
Identity = 122/289 (42.21%), Postives = 177/289 (61.25%), Query Frame = 0
Query: 6 ESETASEEGAKEVKNKGEK-EKGKPVDNSASREASKSSGKDGSTSKRKDGSSGEDCDSSN 65
+ +T +G+K + + K ++GK + + +E ++ K ++S++K G GE+CD SN
Sbjct: 66 DDDTQLGDGSKMIGSDSSKSDQGKIASDESDKEEEEAVSK--NSSRKKQGFHGEECDPSN 125
Query: 66 KCSDEENKLVACLRVPGNDSPDFSLLIQNKGTGPLTVKISAPDFVQLEKHEVQLQEKEDK 125
C D+E++ ACLRVPGND+P SLLIQNKG L V I+AP FV+LEK +VQL + ED
Sbjct: 126 MCIDDEHEFSACLRVPGNDAPHLSLLIQNKGKRALIVTITAPVFVRLEKDKVQLLQNEDI 185
Query: 126 KVKFSIGAEQGDNTIVVLSAGSGHCNLDFRNLIMPNVGKDSDNLPKSSRFSYL---SKPP 185
KVK SI +++ +VL++ G C L+ ++L +SD+ SR S L S+
Sbjct: 186 KVKVSIKKGGSNDSAIVLASSKGRCRLELKDLAAAAHETESDDTVSVSRPSILNISSRTL 245
Query: 186 VIAILAFAIILTFAAALVFISIRRKISVSSNSKYQRLDMELPVS----IGGKSAADNNDG 245
++ I+ ++L+ V I + + S N+KYQRLDMELPVS + +DG
Sbjct: 246 IVIIMISFLVLSLVIIPVIIHVYKNKS-RGNNKYQRLDMELPVSNPALVTKSDQESGDDG 305
Query: 246 WENSWDDNWDD-------EAPHTPSLPLTPNLSSKGLASRRLNKEGWKD 280
W N+W D+WDD E P+TP LPLTP+LSS+GLA RRL+KEGWKD
Sbjct: 306 WNNNWGDDWDDENGGGDEEQPNTPVLPLTPSLSSRGLAPRRLSKEGWKD 351
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038883447.1 | 2.1e-114 | 78.50 | uncharacterized protein LOC120074402 [Benincasa hispida] | [more] |
XP_022942101.1 | 8.8e-113 | 78.50 | uncharacterized protein LOC111447272 isoform X2 [Cucurbita moschata] | [more] |
XP_022942100.1 | 8.8e-113 | 78.50 | uncharacterized protein LOC111447272 isoform X1 [Cucurbita moschata] | [more] |
XP_022981480.1 | 4.4e-112 | 78.16 | uncharacterized protein LOC111480580 isoform X2 [Cucurbita maxima] | [more] |
XP_022981471.1 | 4.4e-112 | 78.16 | uncharacterized protein LOC111480580 isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FVL0 | 4.3e-113 | 78.50 | uncharacterized protein LOC111447272 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1FMW7 | 4.3e-113 | 78.50 | uncharacterized protein LOC111447272 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IZL6 | 2.1e-112 | 78.16 | uncharacterized protein LOC111480580 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IWN0 | 2.1e-112 | 78.16 | uncharacterized protein LOC111480580 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A5D3CXF4 | 1.0e-111 | 76.11 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT1G64385.1 | 7.3e-49 | 42.21 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
Relationships
This mRNA is a part of the following gene feature(s):
Feature Name | Unique Name | Type |
Sed0007649 | Sed0007649 | gene |
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0007649.4-five_prime_utr | Sed0007649.4-five_prime_utr-LG07:12237423..12237546 | five_prime_UTR |
Sed0007649.4-five_prime_utr | Sed0007649.4-five_prime_utr-LG07:12237667..12237806 | five_prime_UTR |
Sed0007649.4-five_prime_utr | Sed0007649.4-five_prime_utr-LG07:12237860..12238017 | five_prime_UTR |
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0007649.4-exon | Sed0007649.4-exon-LG07:12237423..12237546 | exon |
Sed0007649.4-exon | Sed0007649.4-exon-LG07:12237667..12237806 | exon |
Sed0007649.4-exon | Sed0007649.4-exon-LG07:12237860..12238264 | exon |
Sed0007649.4-exon | Sed0007649.4-exon-LG07:12239246..12239373 | exon |
Sed0007649.4-exon | Sed0007649.4-exon-LG07:12244571..12245989 | exon |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0007649.4-cds | Sed0007649.4-cds-LG07:12238018..12238264 | CDS |
Sed0007649.4-cds | Sed0007649.4-cds-LG07:12239246..12239373 | CDS |
Sed0007649.4-cds | Sed0007649.4-cds-LG07:12244571..12245035 | CDS |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Sed0007649.4-three_prime_utr | Sed0007649.4-three_prime_utr-LG07:12245036..12245989 | three_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
Sed0007649.4 | Sed0007649.4-protein | polypeptide |