Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAAGGCAGAATCTCGCGGCGGAATTGATTCCAGGGAAAATCAGAAAACGAGGCTGTTCCTCATCCGCTTCTTCCTCATCTTCAATTCTTCAAAATTACAGGTTTAAGCGAGCCATTCTGGTGGGTAAAAGGGCTGGATCCTCAACTCCATTGCCTTCATGGAGGCTTATGACCTCACGATCGAGATCCCCTGCTTCAGCATTTTGTTCCACTGAGTCTCCCAATTACGAGCTTCATCAGTGCGCCAGTGGCCGGTCCAAGCAAGCTCCGGTGTCAGCGAGGAAGCTGGCGGCGACGCTGTGGGAGATGAATGAGTTGCCATCGACAAGGGTGAAGGAGGGTCTGGCCTTGGAGGAGAGGAAATCAAGAAAGGAAATGAAGGGCAGAGAGAAAACGACACGGTCTGTTCATTCTGGTTCTTTGCCTCCCCATCTCTCCGATCCGTCGCATAGCCCTGTTTCCGAGGTGGGTTTCTGCTCTTTTCCCTCTTTTCTCAATTGGGTTCTTTACTTTTGTTTGGTTGATCTGATTTTCTGCATTAAATATGTAGAACTCTGTTTGTTGTTTGAGAGCTATTTCTATTTGATTCAATTCTTTGGTCTTGGGATACAAATTAATGGAGGAGATTGACATGTGGGGCTTAGATGAAAGGCTTTTCCTACTCAATATGATCTAACATTGTATTGTTCCTTTCCTGTGATCTGTTGGATTAGATATGGAAATTGTATTTGTATCTATCATGTATGTTCTTCAGGATTCTGGTTTCTACTTTTGTAGAGAGCGGATCGCTCTGGAACAGGCAGTCGCTGTCGAAGAACTCTGTCCATGTCTCAGAGGCTAAAGCTTGCTGATCATGGCGTTGGAGTTCTTGATTCTGTGAGCAATGCTAGCTTGATGGAGGTATGCTGTGTATTAGGTGCTTTATCCGTACATCTTTGTTTGATATCTTGTATGACAATTCCCTAGTGATCCATTAAGCTTATAATCAGATTCGTTTTATACCAATGGCGAAAAGGCGAGTTCATGTATGATAGTTTGATGTTTTTCATATAGATCGAGACAAGATCGAGGATCCAAACTCCGATTGCATCGAATGTCGGTGTTAGGACACGGCTGAAGGATGTTAGTAATGCGTTGACAACTTCTAAGGAGCTTCTCAAAATCATTAACCGTGTTTGGGGACATGAAGATCGTCCTTCCACGAGCATGTCCTTAATCTCGGCCTTACACGCTGAGCTAGAGAGGGCTCGATTGCAGGTCAATCAGCTTGTCCAAGAGCAAAGATATGAGCAGAATGACATAAGCTATCTGATGAGGTGCTTTGCTGAAGAGAAGGAAGCATGGAAGAGCAAGGAGCAAGAAGTTGTGGAGGCAGCGATTGAATCTGTGGCTGGAGAGCTCGAAGTCGAAAGGAAACTTCGAAGAAGGTTCGAGAGCTTGAACAAGAAGCTTGGACGAGAACTTGCCGAGACAAAGTCAACACTTCTGAAAGTAGTCAAAGAACTCGAGAGTGAAAAGAGAGCAAGGGAAATTATGGAGCAAGTATGTGATGACTTAGCAAACGATGTCGGGGACGAAAAGCTAAAAGTCGAGGAAATGCAGAAAGAATCTGCTAAACTTTGTGAGAATGTCAATAAAGAAAGAGAAATGAAGCGAATAGCTGCTGTGCTGCGTGACGAACAAGCTCATATAGATACCGATCTCGACGACAAAAACGCTGCAGTCGATAAACTGAGGAATCAACTCGAAGCGTTCTTGGGTATCAAAAGAGCTAAAGAAAAGGAGTTCGGATCGAAGGACTCGAACGAAGTGAAATTTGCAGCATATCGAAACAAAAACGGGGTTCATTCGTTTCAAAGCGAGGAAAAGGAAGAAGGGGAAGTTGTAGACGGAGTAGAATGTGAAGAAGATTTGGCTGAGAGTGATCTTCACTCAATAGAATTAAACATGGACAACAACAAAAGCTATGACTGGATTCATTCTTCTGGCATTCCTCTTGATTCAAGGAGGCCTTCGATCGACGATGAACACAAGGCGAGAAAATCGACATCTAAGAAGGGTTCGAGAAAAAGCACATCCTTACAAAGAAGCATTTCCGAGGGAGAGGAATGGGGAGGAAACCAAGCTGAAAACCTTCCAATTTCAGGGGATCATCATCATCATCATCATCATGCTTTAGATTGGGAAAGAAGCTCAGTACTGGAGAAACTAGCTTATGGAGATCAATATCAAGGATACAATTCATCGAAGAACCTCCGAGACCAAATCTTATCCGGGTCAAGGCTCGGTTCAATGAAAGTCACTGCCAGCCCGACACGGGTGTGGGAACAAGCACGACCCTCTAGAGAACTTACAGATTCCCTGGTGCAAGGCAATGGCTTGAAGTCTAGGCTGATGGAGGTTAGAGGCGAGGGTGGGCTAAGTTCAAGGAAGTACAAATAATAACATAACCATGGCAACAACGAGAGCTGCTACAAAGTTGCAGCAGCCCGCTGCTCAAAAGCTCTCAAGGCGGCTGCCAATCCAATCCAATCCAATCCCAGTAGATATATATATTGTGAAATAGAAGACTGCCCATGTTTTGTAAGATTTCTGTAGAAAAAGAAACGCTTTGAATTCTTTTTCGTTCTTATTTTTGCTTGTTTCCCCAGCCCATCTGGAACTGGATCATGTTGAAGATGGTGTGTTCTTTGAGTAATTTTCAACCATTGAAGAGCTTTTTTGTTGTCAGGATTTTGTACAGAAGAATGTTCAAATTTTTATGATCAAACTTTGCT
mRNA sequence
ATGCCAAGGCAGAATCTCGCGGCGGAATTGATTCCAGGGAAAATCAGAAAACGAGGCTGTTCCTCATCCGCTTCTTCCTCATCTTCAATTCTTCAAAATTACAGGTTTAAGCGAGCCATTCTGGTGGGTAAAAGGGCTGGATCCTCAACTCCATTGCCTTCATGGAGGCTTATGACCTCACGATCGAGATCCCCTGCTTCAGCATTTTGTTCCACTGAGTCTCCCAATTACGAGCTTCATCAGTGCGCCAGTGGCCGGTCCAAGCAAGCTCCGGTGTCAGCGAGGAAGCTGGCGGCGACGCTGTGGGAGATGAATGAGTTGCCATCGACAAGGGTGAAGGAGGGTCTGGCCTTGGAGGAGAGGAAATCAAGAAAGGAAATGAAGGGCAGAGAGAAAACGACACGGTCTGTTCATTCTGGTTCTTTGCCTCCCCATCTCTCCGATCCGTCGCATAGCCCTGTTTCCGAGGTGGATATGGAAATTGTATTTGTATCTATCATGTATAGGCTAAAGCTTGCTGATCATGGCGTTGGAGTTCTTGATTCTGTGAGCAATGCTAGCTTGATGGAGATCGAGACAAGATCGAGGATCCAAACTCCGATTGCATCGAATGTCGGTGTTAGGACACGGCTGAAGGATGTTAGTAATGCGTTGACAACTTCTAAGGAGCTTCTCAAAATCATTAACCGTGTTTGGGGACATGAAGATCGTCCTTCCACGAGCATGTCCTTAATCTCGGCCTTACACGCTGAGCTAGAGAGGGCTCGATTGCAGGTCAATCAGCTTGTCCAAGAGCAAAGATATGAGCAGAATGACATAAGCTATCTGATGAGGTGCTTTGCTGAAGAGAAGGAAGCATGGAAGAGCAAGGAGCAAGAAGTTGTGGAGGCAGCGATTGAATCTGTGGCTGGAGAGCTCGAAGTCGAAAGGAAACTTCGAAGAAGGTTCGAGAGCTTGAACAAGAAGCTTGGACGAGAACTTGCCGAGACAAAGTCAACACTTCTGAAAGTAGTCAAAGAACTCGAGAGTGAAAAGAGAGCAAGGGAAATTATGGAGCAAGTATGTGATGACTTAGCAAACGATGTCGGGGACGAAAAGCTAAAAGTCGAGGAAATGCAGAAAGAATCTGCTAAACTTTGTGAGAATGTCAATAAAGAAAGAGAAATGAAGCGAATAGCTGCTGTGCTGCGTGACGAACAAGCTCATATAGATACCGATCTCGACGACAAAAACGCTGCAGTCGATAAACTGAGGAATCAACTCGAAGCGTTCTTGGGTATCAAAAGAGCTAAAGAAAAGGAGTTCGGATCGAAGGACTCGAACGAAGTGAAATTTGCAGCATATCGAAACAAAAACGGGGTTCATTCGTTTCAAAGCGAGGAAAAGGAAGAAGGGGAAGTTGTAGACGGAGTAGAATGTGAAGAAGATTTGGCTGAGAGTGATCTTCACTCAATAGAATTAAACATGGACAACAACAAAAGCTATGACTGGATTCATTCTTCTGGCATTCCTCTTGATTCAAGGAGGCCTTCGATCGACGATGAACACAAGGCGAGAAAATCGACATCTAAGAAGGGTTCGAGAAAAAGCACATCCTTACAAAGAAGCATTTCCGAGGGAGAGGAATGGGGAGGAAACCAAGCTGAAAACCTTCCAATTTCAGGGGATCATCATCATCATCATCATCATGCTTTAGATTGGGAAAGAAGCTCAGTACTGGAGAAACTAGCTTATGGAGATCAATATCAAGGATACAATTCATCGAAGAACCTCCGAGACCAAATCTTATCCGGGTCAAGGCTCGGTTCAATGAAAGTCACTGCCAGCCCGACACGGGTGTGGGAACAAGCACGACCCTCTAGAGAACTTACAGATTCCCTGGTGCAAGGCAATGGCTTGAAGTCTAGGCTGATGGAGGTTAGAGGCGAGGGTGGGCTAAGTTCAAGGAAGTACAAATAATAACATAACCATGGCAACAACGAGAGCTGCTACAAAGTTGCAGCAGCCCGCTGCTCAAAAGCTCTCAAGGCGGCTGCCAATCCAATCCAATCCAATCCCAGTAGATATATATATTGTGAAATAGAAGACTGCCCATGTTTTGTAAGATTTCTGTAGAAAAAGAAACGCTTTGAATTCTTTTTCGTTCTTATTTTTGCTTGTTTCCCCAGCCCATCTGGAACTGGATCATGTTGAAGATGGTGTGTTCTTTGAGTAATTTTCAACCATTGAAGAGCTTTTTTGTTGTCAGGATTTTGTACAGAAGAATGTTCAAATTTTTATGATCAAACTTTGCT
Coding sequence (CDS)
ATGCCAAGGCAGAATCTCGCGGCGGAATTGATTCCAGGGAAAATCAGAAAACGAGGCTGTTCCTCATCCGCTTCTTCCTCATCTTCAATTCTTCAAAATTACAGGTTTAAGCGAGCCATTCTGGTGGGTAAAAGGGCTGGATCCTCAACTCCATTGCCTTCATGGAGGCTTATGACCTCACGATCGAGATCCCCTGCTTCAGCATTTTGTTCCACTGAGTCTCCCAATTACGAGCTTCATCAGTGCGCCAGTGGCCGGTCCAAGCAAGCTCCGGTGTCAGCGAGGAAGCTGGCGGCGACGCTGTGGGAGATGAATGAGTTGCCATCGACAAGGGTGAAGGAGGGTCTGGCCTTGGAGGAGAGGAAATCAAGAAAGGAAATGAAGGGCAGAGAGAAAACGACACGGTCTGTTCATTCTGGTTCTTTGCCTCCCCATCTCTCCGATCCGTCGCATAGCCCTGTTTCCGAGGTGGATATGGAAATTGTATTTGTATCTATCATGTATAGGCTAAAGCTTGCTGATCATGGCGTTGGAGTTCTTGATTCTGTGAGCAATGCTAGCTTGATGGAGATCGAGACAAGATCGAGGATCCAAACTCCGATTGCATCGAATGTCGGTGTTAGGACACGGCTGAAGGATGTTAGTAATGCGTTGACAACTTCTAAGGAGCTTCTCAAAATCATTAACCGTGTTTGGGGACATGAAGATCGTCCTTCCACGAGCATGTCCTTAATCTCGGCCTTACACGCTGAGCTAGAGAGGGCTCGATTGCAGGTCAATCAGCTTGTCCAAGAGCAAAGATATGAGCAGAATGACATAAGCTATCTGATGAGGTGCTTTGCTGAAGAGAAGGAAGCATGGAAGAGCAAGGAGCAAGAAGTTGTGGAGGCAGCGATTGAATCTGTGGCTGGAGAGCTCGAAGTCGAAAGGAAACTTCGAAGAAGGTTCGAGAGCTTGAACAAGAAGCTTGGACGAGAACTTGCCGAGACAAAGTCAACACTTCTGAAAGTAGTCAAAGAACTCGAGAGTGAAAAGAGAGCAAGGGAAATTATGGAGCAAGTATGTGATGACTTAGCAAACGATGTCGGGGACGAAAAGCTAAAAGTCGAGGAAATGCAGAAAGAATCTGCTAAACTTTGTGAGAATGTCAATAAAGAAAGAGAAATGAAGCGAATAGCTGCTGTGCTGCGTGACGAACAAGCTCATATAGATACCGATCTCGACGACAAAAACGCTGCAGTCGATAAACTGAGGAATCAACTCGAAGCGTTCTTGGGTATCAAAAGAGCTAAAGAAAAGGAGTTCGGATCGAAGGACTCGAACGAAGTGAAATTTGCAGCATATCGAAACAAAAACGGGGTTCATTCGTTTCAAAGCGAGGAAAAGGAAGAAGGGGAAGTTGTAGACGGAGTAGAATGTGAAGAAGATTTGGCTGAGAGTGATCTTCACTCAATAGAATTAAACATGGACAACAACAAAAGCTATGACTGGATTCATTCTTCTGGCATTCCTCTTGATTCAAGGAGGCCTTCGATCGACGATGAACACAAGGCGAGAAAATCGACATCTAAGAAGGGTTCGAGAAAAAGCACATCCTTACAAAGAAGCATTTCCGAGGGAGAGGAATGGGGAGGAAACCAAGCTGAAAACCTTCCAATTTCAGGGGATCATCATCATCATCATCATCATGCTTTAGATTGGGAAAGAAGCTCAGTACTGGAGAAACTAGCTTATGGAGATCAATATCAAGGATACAATTCATCGAAGAACCTCCGAGACCAAATCTTATCCGGGTCAAGGCTCGGTTCAATGAAAGTCACTGCCAGCCCGACACGGGTGTGGGAACAAGCACGACCCTCTAGAGAACTTACAGATTCCCTGGTGCAAGGCAATGGCTTGAAGTCTAGGCTGATGGAGGTTAGAGGCGAGGGTGGGCTAAGTTCAAGGAAGTACAAATAA
Protein sequence
MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTSRSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEERKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSEVDMEIVFVSIMYRLKLADHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWGHEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQEVVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQVCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDTDLDDKNAAVDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVECEEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSRKSTSLQRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLAYGDQYQGYNSSKNLRDQILSGSRLGSMKVTASPTRVWEQARPSRELTDSLVQGNGLKSRLMEVRGEGGLSSRKYK
Homology
BLAST of CmoCh06G004980 vs. ExPASy Swiss-Prot
Match:
Q66GQ2 (Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 PE=2 SV=2)
HSP 1 Score: 114.0 bits (284), Expect = 6.4e-24
Identity = 136/499 (27.25%), Postives = 246/499 (49.30%), Query Frame = 0
Query: 92 VSARKLAATLWEMNEL---------------PSTRVKEG-LALEERKSRKEMKGREKTTR 151
VS+RKLAA WE ++ S ++ G SR++ G+ +
Sbjct: 46 VSSRKLAAAFWEFHQYHYKDEEDCSYSYLSSASAKMHRGPNGFAGASSRRQRHGKAVAVK 105
Query: 152 SVHSGSLPPHLSDPS--HSPVSEVDMEIVFVSIMYR----LKLADHGVGVLDSVSNASLM 211
+ L L DPS H P S + ++ + + +H + + S S +
Sbjct: 106 E-NGLDLSQFLRDPSPDHQPDSAGSLRRQIGQMLIKHHQSIDRNNHALQPVSPASYGSSL 165
Query: 212 EIETRSRIQTPIASNVGVRTR-LKDVSNALTTSKELLKIINRVWGHEDRPSTSMSLISAL 271
E+ T ++ TP +S++ R R ++ L TS ELLK++NR+W E++ +++SLI AL
Sbjct: 166 EVTTYNKAVTP-SSSLEFRGRPSREPHYNLKTSTELLKVLNRIWSLEEQHVSNISLIKAL 225
Query: 272 HAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQEVVEAAIESVAGELEV 331
E+ +R+++ +L++ Q+ +++++ +++ AEEK K+KE E + +A++SV LE
Sbjct: 226 KTEVAHSRVRIKELLRYQQADRHELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALED 285
Query: 332 ERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQVCDDLANDVGDEKLK 391
ERKLR+R ESL++K+ REL+E KS+L VKELE ++ ++ME +CD+ A + + +
Sbjct: 286 ERKLRKRSESLHRKMARELSEVKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEE 345
Query: 392 VEEMQKES--AKLCENVNKEREMKRIAAVLRDEQAHIDTD----LDDKNAAV-DKLRNQL 451
+ ++K++ ++ + IA DE+ + + L+ KN +V DKL ++
Sbjct: 346 IHGLKKKNLDKDWAGRGGGDQLVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEI 405
Query: 452 EAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVECEEDLAESD 511
E FL + K E N ++ + ++ + ++ V+CEED SD
Sbjct: 406 ETFL---QEKRNEIPRNRRNSLESVPF------NTLSAPPRD-------VDCEEDSGGSD 465
Query: 512 LHSIELNM------DNNKSYDWIHSSG-IPLDSRRPS---IDDEHKARKSTSKKGSRKST 551
+ EL D K + + G I + PS ++ E + + S G +K+T
Sbjct: 466 SNCFELKKPAESYGDETKKPNQHNKDGSIDEKPKSPSSFQVNFEDQMAWALSSNGKKKTT 523
BLAST of CmoCh06G004980 vs. ExPASy TrEMBL
Match:
A0A6J1F862 (uncharacterized protein At5g41620 OS=Cucurbita moschata OX=3662 GN=LOC111443020 PE=4 SV=1)
HSP 1 Score: 1199.1 bits (3101), Expect = 0.0e+00
Identity = 642/659 (97.42%), Postives = 644/659 (97.72%), Query Frame = 0
Query: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS 60
MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS
Sbjct: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS 60
Query: 61 RSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE 120
RSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE
Sbjct: 61 RSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE 120
Query: 121 RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSE-VDMEIV------FVSIMYRLKLA 180
RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSE D +S+ RLKLA
Sbjct: 121 RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSERADRSGTGSRCRRTLSMSQRLKLA 180
Query: 181 DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG 240
DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG
Sbjct: 181 DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG 240
Query: 241 HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE 300
HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE
Sbjct: 241 HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE 300
Query: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ 360
VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ
Sbjct: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ 360
Query: 361 VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDTDLDDKNAA 420
VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDTDLDDKNAA
Sbjct: 361 VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDTDLDDKNAA 420
Query: 421 VDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVEC 480
VDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVEC
Sbjct: 421 VDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVEC 480
Query: 481 EEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSRKSTSL 540
EEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSRKSTSL
Sbjct: 481 EEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSRKSTSL 540
Query: 541 QRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLAYGDQYQGYNSSKNLRD 600
QRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLAYGDQYQGYNSSKNLRD
Sbjct: 541 QRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLAYGDQYQGYNSSKNLRD 600
Query: 601 QILSGSRLGSMKVTASPTRVWEQARPSRELTDSLVQGNGLKSRLMEVRGEGGLSSRKYK 653
QILSGSRLGSMKVTASPTRVWEQARPSRELTDSLVQGNGLKSRLMEVRGEGGLSSRKYK
Sbjct: 601 QILSGSRLGSMKVTASPTRVWEQARPSRELTDSLVQGNGLKSRLMEVRGEGGLSSRKYK 659
BLAST of CmoCh06G004980 vs. ExPASy TrEMBL
Match:
A0A6J1KU72 (uncharacterized protein At5g41620 OS=Cucurbita maxima OX=3661 GN=LOC111498279 PE=4 SV=1)
HSP 1 Score: 1172.5 bits (3032), Expect = 0.0e+00
Identity = 633/659 (96.05%), Postives = 637/659 (96.66%), Query Frame = 0
Query: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS 60
MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS
Sbjct: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS 60
Query: 61 RSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE 120
RSRSPASAF STESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE
Sbjct: 61 RSRSPASAFRSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE 120
Query: 121 RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSE-VDMEIV------FVSIMYRLKLA 180
RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSE D S+ R KLA
Sbjct: 121 RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSERADRSGTGSRCRRTPSMSQRPKLA 180
Query: 181 DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG 240
DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG
Sbjct: 181 DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG 240
Query: 241 HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE 300
HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE
Sbjct: 241 HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE 300
Query: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ 360
VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ
Sbjct: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ 360
Query: 361 VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDTDLDDKNAA 420
VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDTDLDDKNAA
Sbjct: 361 VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDTDLDDKNAA 420
Query: 421 VDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVEC 480
VDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVEC
Sbjct: 421 VDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVEC 480
Query: 481 EEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSRKSTSL 540
EEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDE K+RKSTSKKGSRKSTSL
Sbjct: 481 EEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDELKSRKSTSKKGSRKSTSL 540
Query: 541 QRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLAYGDQYQGYNSSKNLRD 600
QRSISEGEEWGGNQAENLPISGD HHHHHH LDWERSSVLEKLAYGDQYQGYNSSKNLRD
Sbjct: 541 QRSISEGEEWGGNQAENLPISGD-HHHHHHVLDWERSSVLEKLAYGDQYQGYNSSKNLRD 600
Query: 601 QILSGSRLGSMKVTASPTRVWEQARPSRELTDSLVQGNGLKSRLMEVRGEGGLSSRKYK 653
QILSGSRLGSMKVTASPTR+WEQARPSREL DS+VQGNGLKSRLMEVRGEGGLSSRKYK
Sbjct: 601 QILSGSRLGSMKVTASPTRLWEQARPSRELADSMVQGNGLKSRLMEVRGEGGLSSRKYK 658
BLAST of CmoCh06G004980 vs. ExPASy TrEMBL
Match:
A0A5D3DNS4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004220 PE=4 SV=1)
HSP 1 Score: 1023.5 bits (2645), Expect = 4.0e-295
Identity = 572/676 (84.62%), Postives = 605/676 (89.50%), Query Frame = 0
Query: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS 60
MPRQNLAAELIPGKIRKRGCSSSASSSSSIL NYRFKRAILVGKRAGSSTPLPSWRLM+S
Sbjct: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILHNYRFKRAILVGKRAGSSTPLPSWRLMSS 60
Query: 61 RSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE 120
RSRSPASAF STESPNYEL+QC SGRSKQAPVSARKLAATLWEMNELPSTRVKE LAL+E
Sbjct: 61 RSRSPASAFRSTESPNYELYQCGSGRSKQAPVSARKLAATLWEMNELPSTRVKESLALDE 120
Query: 121 RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSEVDMEI-------VFVSIMYRLKLA 180
RKSRKEMK REKTTRSVHSGSLPPHLSDPSHSPVSE S+ RLKLA
Sbjct: 121 RKSRKEMKAREKTTRSVHSGSLPPHLSDPSHSPVSERGDRSGTGSRCRRTPSMSQRLKLA 180
Query: 181 DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG 240
DHGVGVLDSVSNASLMEIETRSR TP AS VGV+TRLKDVS+ALTTSKELLKIINRVWG
Sbjct: 181 DHGVGVLDSVSNASLMEIETRSRAPTPSASIVGVKTRLKDVSSALTTSKELLKIINRVWG 240
Query: 241 HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE 300
HEDRPSTSMSLISALHAELERARLQ+NQL+QEQRYEQ+DISYLMRCFAEEKEAWK+KEQE
Sbjct: 241 HEDRPSTSMSLISALHAELERARLQINQLIQEQRYEQSDISYLMRCFAEEKEAWKNKEQE 300
Query: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ 360
VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKS+LLKVVKELESEKRAREIMEQ
Sbjct: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSSLLKVVKELESEKRAREIMEQ 360
Query: 361 VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDT----DLDD 420
VCDDLANDVGD+KL++ EMQ+ESAKLC+NV KEREMKR+AA L +EQ HID DL+D
Sbjct: 361 VCDDLANDVGDDKLELGEMQRESAKLCDNVKKEREMKRLAAALHEEQTHIDASDKYDLED 420
Query: 421 KNAAVDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVD 480
KNAAVDKLRNQLE+FLGIKRAKEKEFGS DSNEVKFAAY NKNG+ SFQ EEKEEGEVVD
Sbjct: 421 KNAAVDKLRNQLESFLGIKRAKEKEFGSNDSNEVKFAAYLNKNGIRSFQCEEKEEGEVVD 480
Query: 481 GVECEEDLAESDLHSIELNMD-NNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSR 540
GVECEEDLAESDLHSIELN+D NNKSYDWIHSSGIPLD+RRPSIDDE KARKSTSKKGSR
Sbjct: 481 GVECEEDLAESDLHSIELNVDNNNKSYDWIHSSGIPLDTRRPSIDDEPKARKSTSKKGSR 540
Query: 541 KSTSLQRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLA----YGDQYQG 600
KSTS+QRSIS+G EW GNQA+N PI GD H LDWERSSVLEK+A YGD + G
Sbjct: 541 KSTSIQRSISDGVEW-GNQADNHPILGD------HVLDWERSSVLEKVASGKVYGDHFPG 600
Query: 601 YN-SSKNLRDQILSGSRLGSMKVTASPTRVWEQARPSRELTD------SLVQG-NGLKSR 653
YN SSKNLRDQILSGSRLGS+KVTASPTR+WEQARP R+L D S+VQG NGLKSR
Sbjct: 601 YNSSSKNLRDQILSGSRLGSLKVTASPTRLWEQARPLRDLADPVTERASMVQGSNGLKSR 660
BLAST of CmoCh06G004980 vs. ExPASy TrEMBL
Match:
A0A1S3B697 (uncharacterized protein At5g41620 OS=Cucumis melo OX=3656 GN=LOC103486645 PE=4 SV=1)
HSP 1 Score: 1023.5 bits (2645), Expect = 4.0e-295
Identity = 572/676 (84.62%), Postives = 605/676 (89.50%), Query Frame = 0
Query: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS 60
MPRQNLAAELIPGKIRKRGCSSSASSSSSIL NYRFKRAILVGKRAGSSTPLPSWRLM+S
Sbjct: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILHNYRFKRAILVGKRAGSSTPLPSWRLMSS 60
Query: 61 RSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE 120
RSRSPASAF STESPNYEL+QC SGRSKQAPVSARKLAATLWEMNELPSTRVKE LAL+E
Sbjct: 61 RSRSPASAFRSTESPNYELYQCGSGRSKQAPVSARKLAATLWEMNELPSTRVKESLALDE 120
Query: 121 RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSEVDMEI-------VFVSIMYRLKLA 180
RKSRKEMK REKTTRSVHSGSLPPHLSDPSHSPVSE S+ RLKLA
Sbjct: 121 RKSRKEMKAREKTTRSVHSGSLPPHLSDPSHSPVSERGDRSGTGSRCRRTPSMSQRLKLA 180
Query: 181 DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG 240
DHGVGVLDSVSNASLMEIETRSR TP AS VGV+TRLKDVS+ALTTSKELLKIINRVWG
Sbjct: 181 DHGVGVLDSVSNASLMEIETRSRAPTPSASIVGVKTRLKDVSSALTTSKELLKIINRVWG 240
Query: 241 HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE 300
HEDRPSTSMSLISALHAELERARLQ+NQL+QEQRYEQ+DISYLMRCFAEEKEAWK+KEQE
Sbjct: 241 HEDRPSTSMSLISALHAELERARLQINQLIQEQRYEQSDISYLMRCFAEEKEAWKNKEQE 300
Query: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ 360
VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKS+LLKVVKELESEKRAREIMEQ
Sbjct: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSSLLKVVKELESEKRAREIMEQ 360
Query: 361 VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDT----DLDD 420
VCDDLANDVGD+KL++ EMQ+ESAKLC+NV KEREMKR+AA L +EQ HID DL+D
Sbjct: 361 VCDDLANDVGDDKLELGEMQRESAKLCDNVKKEREMKRLAAALHEEQTHIDASDKYDLED 420
Query: 421 KNAAVDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVD 480
KNAAVDKLRNQLE+FLGIKRAKEKEFGS DSNEVKFAAY NKNG+ SFQ EEKEEGEVVD
Sbjct: 421 KNAAVDKLRNQLESFLGIKRAKEKEFGSNDSNEVKFAAYLNKNGIRSFQCEEKEEGEVVD 480
Query: 481 GVECEEDLAESDLHSIELNMD-NNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSR 540
GVECEEDLAESDLHSIELN+D NNKSYDWIHSSGIPLD+RRPSIDDE KARKSTSKKGSR
Sbjct: 481 GVECEEDLAESDLHSIELNVDNNNKSYDWIHSSGIPLDTRRPSIDDEPKARKSTSKKGSR 540
Query: 541 KSTSLQRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLA----YGDQYQG 600
KSTS+QRSIS+G EW GNQA+N PI GD H LDWERSSVLEK+A YGD + G
Sbjct: 541 KSTSIQRSISDGVEW-GNQADNHPILGD------HVLDWERSSVLEKVASGKVYGDHFPG 600
Query: 601 YN-SSKNLRDQILSGSRLGSMKVTASPTRVWEQARPSRELTD------SLVQG-NGLKSR 653
YN SSKNLRDQILSGSRLGS+KVTASPTR+WEQARP R+L D S+VQG NGLKSR
Sbjct: 601 YNSSSKNLRDQILSGSRLGSLKVTASPTRLWEQARPLRDLADPVTERASMVQGSNGLKSR 660
BLAST of CmoCh06G004980 vs. ExPASy TrEMBL
Match:
A0A0A0LAZ1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G778220 PE=4 SV=1)
HSP 1 Score: 1019.2 bits (2634), Expect = 7.5e-294
Identity = 569/676 (84.17%), Postives = 604/676 (89.35%), Query Frame = 0
Query: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILQNYRFKRAILVGKRAGSSTPLPSWRLMTS 60
MPRQNLAAELIPGKIRKRGCSSSASSSSSIL NYRFKRAILVGKRAGSSTPLPSWRLM+S
Sbjct: 1 MPRQNLAAELIPGKIRKRGCSSSASSSSSILHNYRFKRAILVGKRAGSSTPLPSWRLMSS 60
Query: 61 RSRSPASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEE 120
RSRSPASAF STESPNYEL+QC SGRSKQAPVSARKLAATLWEMNELPSTRVKE LAL+E
Sbjct: 61 RSRSPASAFRSTESPNYELYQCGSGRSKQAPVSARKLAATLWEMNELPSTRVKESLALDE 120
Query: 121 RKSRKEMKGREKTTRSVHSGSLPPHLSDPSHSPVSEVDMEI-------VFVSIMYRLKLA 180
RKSRKEMK REKTTRSVHSGSLPPHLSDPSHSPVSE S+ RLKLA
Sbjct: 121 RKSRKEMKAREKTTRSVHSGSLPPHLSDPSHSPVSERGDRSGTGSRCRRTPSMSQRLKLA 180
Query: 181 DHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG 240
DHGVGVLDSVSNASLMEIE+RSR TP AS VGV+TRLKDVSNALTTSKELLKIINRVWG
Sbjct: 181 DHGVGVLDSVSNASLMEIESRSRAPTPSASIVGVKTRLKDVSNALTTSKELLKIINRVWG 240
Query: 241 HEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQE 300
HEDRPSTSMSLISALHAE+ERARLQ+NQL+QEQRYEQ+DISYLMRCFAEEKEAWKSKEQE
Sbjct: 241 HEDRPSTSMSLISALHAEMERARLQINQLIQEQRYEQSDISYLMRCFAEEKEAWKSKEQE 300
Query: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQ 360
VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKS+LLKVVKELESEKRAREIMEQ
Sbjct: 301 VVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSSLLKVVKELESEKRAREIMEQ 360
Query: 361 VCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDT----DLDD 420
VCDDLANDVGD+KL++ E Q+ESAKLC+NV KEREMKR+AA L +E+ H D DL+D
Sbjct: 361 VCDDLANDVGDDKLELGERQRESAKLCDNVKKEREMKRLAAALHEERTHTDASDKYDLED 420
Query: 421 KNAAVDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVD 480
KN AVDKLRNQLEAFLGIKRAKEKEFGS DSNEVKFAAY +KNG+ SFQSEEKEEGEVVD
Sbjct: 421 KNVAVDKLRNQLEAFLGIKRAKEKEFGSNDSNEVKFAAYLSKNGIRSFQSEEKEEGEVVD 480
Query: 481 GVECEEDLAESDLHSIELNMD-NNKSYDWIHSSGIPLDSRRPSIDDEHKARKSTSKKGSR 540
GVECEEDLAESDLHSIELNMD NNKSYDWIHSSGIP D+RRPS+DDE KARKSTSKKGSR
Sbjct: 481 GVECEEDLAESDLHSIELNMDNNNKSYDWIHSSGIPHDTRRPSVDDELKARKSTSKKGSR 540
Query: 541 KSTSLQRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLA----YGDQYQG 600
KSTS+QRSIS+G EW GNQA+N PISGD H LDW+RSSVLEK+A YGD + G
Sbjct: 541 KSTSIQRSISDGVEW-GNQADNHPISGD------HVLDWDRSSVLEKVASGKVYGDHFLG 600
Query: 601 YN-SSKNLRDQILSGSRLGSMKVTASPTRVWEQARPSRELTD------SLVQG-NGLKSR 653
YN SSKNLRDQILSGSRLGS+KVTASPTR+WEQARPSR+L D S+VQG NGLKSR
Sbjct: 601 YNSSSKNLRDQILSGSRLGSLKVTASPTRLWEQARPSRDLADPVTERASMVQGSNGLKSR 660
BLAST of CmoCh06G004980 vs. TAIR 10
Match:
AT3G11590.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G22310.1); Has 22320 Blast hits to 15179 proteins in 1213 species: Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi - 1700; Plants - 1146; Viruses - 65; Other Eukaryotes - 5824 (source: NCBI BLink). )
HSP 1 Score: 535.8 bits (1379), Expect = 4.8e-152
Identity = 351/625 (56.16%), Postives = 424/625 (67.84%), Query Frame = 0
Query: 1 MPRQNLAAE--LIPGKIRKRGCSSSASSSSSIL-QNYRFKRAILVGKRAGSSTPLPSWRL 60
MPRQN + E L+ GKIRKRGCSS SS+SSIL + YRFKRAI+VGKR GS+TP+P+WRL
Sbjct: 1 MPRQNQSVENLLLLGKIRKRGCSSPTSSTSSILREGYRFKRAIVVGKRGGSTTPVPTWRL 60
Query: 61 MTSRSRSP--ASAFCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEG 120
M RS SP + A + SP+ S APVSARKLAATLWEMNE+PS RV E
Sbjct: 61 M-GRSPSPRASGALHAAASPSSHCGSKTGKVSAPAPVSARKLAATLWEMNEMPSPRVVEE 120
Query: 121 LALEERKSRKEMKGREKTTR-SVHSGSLPPHLSDPSHSPVSE-------VDMEIVFVSIM 180
A RKSRKE R SVHSGSLPPHLSDPSHSPVSE + S +
Sbjct: 121 AAPMIRKSRKERIAPLPPPRSSVHSGSLPPHLSDPSHSPVSERMERSGTGSRQRRASSTV 180
Query: 181 YRLKLADHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKI 240
+L+L D VG D +++ S M+IETRSR++TP S VGV+TRLKD SNALTTSKELLKI
Sbjct: 181 QKLRLGDCNVGARDPINSGSFMDIETRSRVETPTGSTVGVKTRLKDCSNALTTSKELLKI 240
Query: 241 INRVWGHEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAW 300
INR+WG +DRPS+SMSL+SALH+ELERARLQVNQL+ E + E NDISYLM+ FAEEK W
Sbjct: 241 INRMWGQDDRPSSSMSLVSALHSELERARLQVNQLIHEHKPENNDISYLMKRFAEEKAVW 300
Query: 301 KSKEQEVVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRA 360
KS EQEVVEAAIESVAGELEVERKLRRRFESLNKKLG+ELAETKS L+K VKE+E+EKRA
Sbjct: 301 KSNEQEVVEAAIESVAGELEVERKLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRA 360
Query: 361 REIMEQVCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDT-- 420
R ++E+VCD+LA D+ ++K +VEE+++ES K+ E V KEREM ++A LR+E+ +
Sbjct: 361 RVMVEKVCDELARDISEDKAEVEELKRESFKVKEEVEKEREMLQLADALREERVQMKLSE 420
Query: 421 ---DLDDKNAAVDKLRNQLEAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEK 480
L++KNAAVDKLRNQL+ +L KR KEK + A N SF S
Sbjct: 421 AKHQLEEKNAAVDKLRNQLQTYLKAKRCKEKTREPPQTQLHNEEAGDYLNHHISFGSYNI 480
Query: 481 EEGEVVDGVECEEDLAESDLHSIELNMDNNKSYDWIHSSGIPLDSRRPSIDDEHKARKST 540
E+GEV +G EE ESDLHSIELN+D NKSY W + +E++ RKST
Sbjct: 481 EDGEVENG--NEEGSGESDLHSIELNID-NKSYKWPYG-------------EENRGRKST 540
Query: 541 SKKGSRKSTSLQRSISEGEEWGGNQAENLPISGDHHHHHHHALDWERSSVLEKLAYGDQY 600
RKS SLQRSIS+ +W Q+E L SGD LDW RS +E Y D+
Sbjct: 541 ----PRKSLSLQRSISDCVDW-VVQSEKLQKSGD------GGLDWGRSIDVEPKGYIDET 597
Query: 601 QGY--NSSKNLRDQILSGSRLGSMK 606
Q Y N + + ILSGSRL + +
Sbjct: 601 QAYKPNKASSKDHHILSGSRLSNFR 597
BLAST of CmoCh06G004980 vs. TAIR 10
Match:
AT5G22310.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11590.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 180.6 bits (457), Expect = 3.9e-45
Identity = 168/498 (33.73%), Postives = 250/498 (50.20%), Query Frame = 0
Query: 14 KIRKRGCSSSASSSSSILQNYRFKRAILVGKRA-----GSSTPLPSWRLMTSRSRSPASA 73
KIRKRG S+SSSSS+ + RFKRAI GKRA GS TP+ S + +++P
Sbjct: 9 KIRKRG--GSSSSSSSLARRNRFKRAIFAGKRAAQDDGGSGTPVKS----ITAAKTPVLL 68
Query: 74 FCSTESPNYELHQCASGRSKQAPVSARKLAATLWEMNELPSTRVKEGLALEERKSRKEMK 133
S E+ + HQ +++ VSARKLAATLWE+N+ V + +S+K +
Sbjct: 69 SFSPENLPIDHHQL-----QKSCVSARKLAATLWEINDDADPPVNSD--KDCLRSKKPSR 128
Query: 134 GREKTTRSVHSGSLPPHLSDPSHSPVSEVDMEIVFVSIMYRLKLADHGVGVLDSVSNASL 193
R K + S PP SDP +L+ + + D +
Sbjct: 129 YRAKKSTEFSSIDFPPRSSDPIS-------------------RLSSERIDLCDDMIRRRS 188
Query: 194 MEIETRSRIQTPIASNVGVRTRLKDVSNALTTSKELLKIINRVWG-HEDRPSTSMSLISA 253
+ + I+ I V+TR K+VS+ LTTSKEL+K++ R+ +D + S LISA
Sbjct: 189 TNPQKLNPIEYKIIGANSVKTRFKNVSDGLTTSKELVKVLKRIGELGDDHKTASNRLISA 248
Query: 254 LHAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQEVVEAAIESVAGELE 313
L EL+RAR + L+ E +E+E IES+ E
Sbjct: 249 LLCELDRARSSLKHLMSE----------------------LDEEEEEKRRLIESLQEEAM 308
Query: 314 VERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQVCDDLANDVGDEKL 373
VERKLRRR E +N++LGREL E K T K+ +E++ EKRA++++E+VCD+L +GD+K
Sbjct: 309 VERKLRRRTEKMNRRLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDELTKGIGDDKK 368
Query: 374 KVEEMQKESAKLCENVNKEREMKRIAAVLRDEQAHIDT-----DLDDKNAAVDKLRNQLE 433
++E KEREM IA VLR+E+ + + +DK AAV++L+ +L
Sbjct: 369 EME--------------KEREMMHIADVLREERVQMKLTEAKFEFEDKYAAVERLKKEL- 412
Query: 434 AFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVECEEDLAESDL 493
+R + E G K S+E++ EV+DG ++D ESDL
Sbjct: 429 -----RRVLDGEEG-KGSSEIRRIL------------------EVIDGSGSDDD-EESDL 412
Query: 494 HSIELNMDNNKSYDWIHS 501
SIELNM++ + ++ S
Sbjct: 489 KSIELNMESGSKWGYVDS 412
BLAST of CmoCh06G004980 vs. TAIR 10
Match:
AT1G50660.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast hits to 15134 proteins in 1325 species: Archae - 461; Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants - 1035; Viruses - 42; Other Eukaryotes - 4809 (source: NCBI BLink). )
HSP 1 Score: 118.6 bits (296), Expect = 1.8e-26
Identity = 146/551 (26.50%), Postives = 264/551 (47.91%), Query Frame = 0
Query: 43 GKRAGSSTPLPSWRLM-TSRSRSPASAFCSTESPNYELHQCASGRSKQ-----APVSARK 102
G+R+ TPL W++ ++ RS E N+++ + + R K PVS RK
Sbjct: 59 GRRSRPETPLLKWKVEDRNKERSGVVEDDDYEDDNHQVARSETTRRKDRRKIARPVSVRK 118
Query: 103 LAATLWEMNELPSTRVKEGLALEERKSRKEMKGREKTTRSVHSGSL-PPHLSDPSHSPVS 162
LAA LW + ++P G E KG+E + G + P+L S P
Sbjct: 119 LAAGLWRL-QVPDASSSGG----------ERKGKEGLGFQGNGGYMGVPYLYHHSDKPSG 178
Query: 163 EVDMEIVFVSIMYRLKLADHGVGVLDSVSNASLMEIETRSRIQTPIASNVGVRTRLKDVS 222
+I + + N L ++E + P ++ G T+ V
Sbjct: 179 GQSNKI------------RQNPSTIATTKNGFLCKLE--PSMPFPHSAMEGA-TKWDPV- 238
Query: 223 NALTTSKELLKIINRVWGHEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISY 282
L T +E+ +I + + D+ ++SL+S+L AELE A ++ L E+R + +
Sbjct: 239 -CLDTMEEVHQIYSNM-KRIDQQVNAVSLVSSLEAELEEAHARIEDLESEKRSHKKKLEQ 298
Query: 283 LMRCFAEEKEAWKSKEQEVVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLL 342
+R +EE+ AW+S+E E V A I+ + ++ E+K R+R E +N KL ELA++K +
Sbjct: 299 FLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREKKTRQRLEIVNHKLVNELADSKLAVK 358
Query: 343 KVVKELESEKRAREIMEQVCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAV 402
+ +++ E E++ARE++E+VCD+LA ++G++K ++E +++ES L E V+ ER M ++A V
Sbjct: 359 RYMQDYEKERKARELIEEVCDELAKEIGEDKAEIEALKRESMSLREEVDDERRMLQMAEV 418
Query: 403 LRDEQAHI-----DTDLDDKNAAVDKLRNQLEAFL-------GIKRAKEKEF-----GSK 462
R+E+ + L+++ + ++KL LE+FL +K +E E S
Sbjct: 419 WREERVQMKLIDAKVALEERYSQMNKLVGDLESFLRSRDIVTDVKEVREAELLRETAASV 478
Query: 463 DSNEVKFAAY--RNKNGVHSFQSEEKEEGEVVDGVECEEDLAESDLHSIELNMDNNKSYD 522
+ E+K Y N + +++ EE GE D E E+ +A S + S+D
Sbjct: 479 NIQEIKEFTYVPANPDDIYAV-FEEMNLGEAHDR-EMEKSVAYSPI-----------SHD 538
Query: 523 -WIHSSGIPLDSRRPSIDDEHKARKSTSKKGSRKSTSLQRSISEGEEWGGNQAE--NLPI 565
+H+ + LD+ + H + + S ++S EE G + + ++P
Sbjct: 539 SKVHT--VSLDANMMNKKGRHSDAYTHQNGDIEEDDSGWETVSHLEEQGSSYSPDGSIPS 565
BLAST of CmoCh06G004980 vs. TAIR 10
Match:
AT5G41620.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, plasma membrane; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: intracellular protein transport protein USO1-related (TAIR:AT1G64180.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 114.0 bits (284), Expect = 4.5e-25
Identity = 136/499 (27.25%), Postives = 246/499 (49.30%), Query Frame = 0
Query: 92 VSARKLAATLWEMNEL---------------PSTRVKEG-LALEERKSRKEMKGREKTTR 151
VS+RKLAA WE ++ S ++ G SR++ G+ +
Sbjct: 46 VSSRKLAAAFWEFHQYHYKDEEDCSYSYLSSASAKMHRGPNGFAGASSRRQRHGKAVAVK 105
Query: 152 SVHSGSLPPHLSDPS--HSPVSEVDMEIVFVSIMYR----LKLADHGVGVLDSVSNASLM 211
+ L L DPS H P S + ++ + + +H + + S S +
Sbjct: 106 E-NGLDLSQFLRDPSPDHQPDSAGSLRRQIGQMLIKHHQSIDRNNHALQPVSPASYGSSL 165
Query: 212 EIETRSRIQTPIASNVGVRTR-LKDVSNALTTSKELLKIINRVWGHEDRPSTSMSLISAL 271
E+ T ++ TP +S++ R R ++ L TS ELLK++NR+W E++ +++SLI AL
Sbjct: 166 EVTTYNKAVTP-SSSLEFRGRPSREPHYNLKTSTELLKVLNRIWSLEEQHVSNISLIKAL 225
Query: 272 HAELERARLQVNQLVQEQRYEQNDISYLMRCFAEEKEAWKSKEQEVVEAAIESVAGELEV 331
E+ +R+++ +L++ Q+ +++++ +++ AEEK K+KE E + +A++SV LE
Sbjct: 226 KTEVAHSRVRIKELLRYQQADRHELDSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALED 285
Query: 332 ERKLRRRFESLNKKLGRELAETKSTLLKVVKELESEKRAREIMEQVCDDLANDVGDEKLK 391
ERKLR+R ESL++K+ REL+E KS+L VKELE ++ ++ME +CD+ A + + +
Sbjct: 286 ERKLRKRSESLHRKMARELSEVKSSLSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEE 345
Query: 392 VEEMQKES--AKLCENVNKEREMKRIAAVLRDEQAHIDTD----LDDKNAAV-DKLRNQL 451
+ ++K++ ++ + IA DE+ + + L+ KN +V DKL ++
Sbjct: 346 IHGLKKKNLDKDWAGRGGGDQLVLHIAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEI 405
Query: 452 EAFLGIKRAKEKEFGSKDSNEVKFAAYRNKNGVHSFQSEEKEEGEVVDGVECEEDLAESD 511
E FL + K E N ++ + ++ + ++ V+CEED SD
Sbjct: 406 ETFL---QEKRNEIPRNRRNSLESVPF------NTLSAPPRD-------VDCEEDSGGSD 465
Query: 512 LHSIELNM------DNNKSYDWIHSSG-IPLDSRRPS---IDDEHKARKSTSKKGSRKST 551
+ EL D K + + G I + PS ++ E + + S G +K+T
Sbjct: 466 SNCFELKKPAESYGDETKKPNQHNKDGSIDEKPKSPSSFQVNFEDQMAWALSSNGKKKTT 523
BLAST of CmoCh06G004980 vs. TAIR 10
Match:
AT3G20350.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50660.1); Has 15095 Blast hits to 11224 proteins in 1051 species: Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi - 1255; Plants - 746; Viruses - 40; Other Eukaryotes - 4245 (source: NCBI BLink). )
HSP 1 Score: 99.0 bits (245), Expect = 1.5e-20
Identity = 98/370 (26.49%), Postives = 190/370 (51.35%), Query Frame = 0
Query: 218 LTTSKELLKIINRV-WGHEDRPSTSMSLISALHAELERARLQVNQLVQEQRYEQNDISYL 277
L T ++ +I V W ++ +SL S++ +L+ AR + L E+R ++ +
Sbjct: 190 LDTRDDVHQIYTNVKWNNQQ--VNDVSLASSIELKLQEARACIKDLESEKRSQKKKLEQF 249
Query: 278 MRCFAEEKEAWKSKEQEVVEAAIESVAGELEVERKLRRRFESLNKKLGRELAETKSTLLK 337
++ +EE+ AW+S+E E V A I+ + ++ E+K R+R E +N KL ELA++K + +
Sbjct: 250 LKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQRLEIVNSKLVNELADSKLAVKR 309
Query: 338 VVKELESEKRAREIMEQVCDDLANDVGDEKLKVEEMQKESAKLCENVNKEREMKRIAAVL 397
+ + + E++ARE++E+VCD+LA ++ ++K ++E ++ ES L E V+ ER M ++A V
Sbjct: 310 YMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKSESMNLREEVDDERRMLQMAEVW 369
Query: 398 RDEQAHI-----DTDLDDKNAAVDKLRNQLEAFLGIKRAK-EKEFGSKDSNEVKFAAYRN 457
R+E+ + L++K + ++KL +EAFL + KE + A+ N
Sbjct: 370 REERVQMKLIDAKVTLEEKYSQMNKLVGDMEAFLSSRNTTGVKEVRVAELLRETAASVDN 429
Query: 458 KNGVHSFQSEEKEEGEVVDGVECEEDLAESDLHSIELNMDNN---KSYDWIHSSGIPLDS 517
+ F E + +++ E ++NM N +S ++ S + S
Sbjct: 430 IQEIKEFTYEPAKPDDILMLFE-------------QMNMGENQDRESEQYVAYSPVSHAS 489
Query: 518 R----RPSIDDEHKARKS---TSKKGS-RKSTSLQRSISEGEEWGGNQA--ENLP-ISGD 567
+ P ++ +K R S T + G + S ++S EE G + + E++P IS
Sbjct: 490 KAHTVSPDVNLINKGRHSNAFTDQNGEFEEDDSGWETVSHSEEHGSSYSPDESIPNISNT 544
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q66GQ2 | 6.4e-24 | 27.25 | Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 P... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1F862 | 0.0e+00 | 97.42 | uncharacterized protein At5g41620 OS=Cucurbita moschata OX=3662 GN=LOC111443020 ... | [more] |
A0A6J1KU72 | 0.0e+00 | 96.05 | uncharacterized protein At5g41620 OS=Cucurbita maxima OX=3661 GN=LOC111498279 PE... | [more] |
A0A5D3DNS4 | 4.0e-295 | 84.62 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3B697 | 4.0e-295 | 84.62 | uncharacterized protein At5g41620 OS=Cucumis melo OX=3656 GN=LOC103486645 PE=4 S... | [more] |
A0A0A0LAZ1 | 7.5e-294 | 84.17 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G778220 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G11590.1 | 4.8e-152 | 56.16 | unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures;... | [more] |
AT5G22310.1 | 3.9e-45 | 33.73 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G50660.1 | 1.8e-26 | 26.50 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... | [more] |
AT5G41620.1 | 4.5e-25 | 27.25 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |
AT3G20350.1 | 1.5e-20 | 26.49 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma mem... | [more] |