Tan0006674 (gene) Snake gourd v1

Overview
NameTan0006674
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0503 protein At3g09070, chloroplastic-like
LocationLG05: 3110998 .. 3112719 (+)
RNA-Seq ExpressionTan0006674
SyntenyTan0006674
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCTGCAGGTCAAAGCTGTATCTCATCGGCTTTCCACTTGTCACCGCCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTACTGCCTCCGTGAACGCCTCGCCGGGATTGATCCCGATACGCGTCAGGAATCGCCTGCTCGGAACCTGCATTCTTCATCGGAGCTCCGTCGGAGTAAATCTTTCTCTGCGGCGAAGCGTGAGGCCGGCATCGGACAACCGGAGGTGCAGCATCGGAAGTCGTGCGATGCTCGCTCCGGGAACTCGTTGTCGGACCTTTTTTGTCGCGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATCGAATCCGAGAACTTAGGTTATGAATTGCGTGAGGTTGTGGCAAATGAGAGGCAATTTAGGGCTTCTGCGGGGGCAATTGGACCGGCTTTGGATACGATCGACGATTTTGCTGGAGAGGATGCTGAGTTCAAGACGATGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAGCGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTGACGATAGCAATGTAGGTGCGGCGAAAACAGAGGCTATCAAGCCGAGAGTGCTTGAAATCAGGGAGACGCGTTCCGAGGTCGGAGAATACGGATTGGGAAGAAGGTCTTGTGATTCAGATCCAAGATTCTCTGTCGATGTAGGTAGAATGTCGTTGGATGATTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGTAGAACTTATCCAAGGCTTACGCCGATGGTTTCAGTTTTGGAGGAGGTCAAATTACCTGGTATTGGATTTGAGAAAGACGGTCCTTCTGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGAAAAGATTCCCGGTGGATCGGCTCAGACTAAAGATTACTATATGGATTCATTGTCTTCTGTAAGGCGGAGGAAGAGTTTTGATCGTTCAAGTTCACACAGAAAAGGGGCCTCAGCGGATTTCGATGACTTGAAATTAATATCAAACGCAAAGGTATCTCCTGCAACCACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAAGGACTCACACTCAAAAGCAACAAGAGATGGCGATTTGAGTGGCACTGATGTTACTTCAAAAGATTCTGTTCCCGATGCAGCTGGGATTGATCGAAAGACGTTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATGCAAAAGCGAAGTGGAGAGAGTAAGTCCGATGATGAAGAAAGCAGTGTTGGAGGAAATGTGGTTGATCGGCCTTATGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAATGGAGAAGCAAACGGTTCTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGGTGGATTTAATGGCGGTAATGATTCGAAACTGAACGGTTTGAGGCGGAGAGACGACTTTACGTTGCAGAGGAATCGGAGCGTCAGGTATTCACCAAATAACTTCGATAATGGCTTATTAAGGTTCTACTTGACACCATTGAGAAGCTACAACAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTCCTTTCAATGTCAAACATGTCATGTAA

mRNA sequence

ATGAATCTGCAGGTCAAAGCTGTATCTCATCGGCTTTCCACTTGTCACCGCCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTACTGCCTCCGTGAACGCCTCGCCGGGATTGATCCCGATACGCGTCAGGAATCGCCTGCTCGGAACCTGCATTCTTCATCGGAGCTCCGTCGGAGTAAATCTTTCTCTGCGGCGAAGCGTGAGGCCGGCATCGGACAACCGGAGGTGCAGCATCGGAAGTCGTGCGATGCTCGCTCCGGGAACTCGTTGTCGGACCTTTTTTGTCGCGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATCGAATCCGAGAACTTAGGTTATGAATTGCGTGAGGTTGTGGCAAATGAGAGGCAATTTAGGGCTTCTGCGGGGGCAATTGGACCGGCTTTGGATACGATCGACGATTTTGCTGGAGAGGATGCTGAGTTCAAGACGATGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAGCGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTGACGATAGCAATGTAGGTGCGGCGAAAACAGAGGCTATCAAGCCGAGAGTGCTTGAAATCAGGGAGACGCGTTCCGAGGTCGGAGAATACGGATTGGGAAGAAGGTCTTGTGATTCAGATCCAAGATTCTCTGTCGATGTAGGTAGAATGTCGTTGGATGATTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGTAGAACTTATCCAAGGCTTACGCCGATGGTTTCAGTTTTGGAGGAGGTCAAATTACCTGGTATTGGATTTGAGAAAGACGGTCCTTCTGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGAAAAGATTCCCGGTGGATCGGCTCAGACTAAAGATTACTATATGGATTCATTGTCTTCTGTAAGGCGGAGGAAGAGTTTTGATCGTTCAAGTTCACACAGAAAAGGGGCCTCAGCGGATTTCGATGACTTGAAATTAATATCAAACGCAAAGGTATCTCCTGCAACCACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAAGGACTCACACTCAAAAGCAACAAGAGATGGCGATTTGAGTGGCACTGATGTTACTTCAAAAGATTCTGTTCCCGATGCAGCTGGGATTGATCGAAAGACGTTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATGCAAAAGCGAAGTGGAGAGAGTAAGTCCGATGATGAAGAAAGCAGTGTTGGAGGAAATGTGGTTGATCGGCCTTATGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAATGGAGAAGCAAACGGTTCTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGGTGGATTTAATGGCGGTAATGATTCGAAACTGAACGGTTTGAGGCGGAGAGACGACTTTACGTTGCAGAGGAATCGGAGCGTCAGGTATTCACCAAATAACTTCGATAATGGCTTATTAAGGTTCTACTTGACACCATTGAGAAGCTACAACAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTCCTTTCAATGTCAAACATGTCATGTAA

Coding sequence (CDS)

ATGAATCTGCAGGTCAAAGCTGTATCTCATCGGCTTTCCACTTGTCACCGCCATCCTAGCAAGCCGGTGACCGGATTCTGCGCCTACTGCCTCCGTGAACGCCTCGCCGGGATTGATCCCGATACGCGTCAGGAATCGCCTGCTCGGAACCTGCATTCTTCATCGGAGCTCCGTCGGAGTAAATCTTTCTCTGCGGCGAAGCGTGAGGCCGGCATCGGACAACCGGAGGTGCAGCATCGGAAGTCGTGCGATGCTCGCTCCGGGAACTCGTTGTCGGACCTTTTTTGTCGCGAAGATAAACCGAGATGTACGAATCGGGAGGTGGAGATCGAATCCGAGAACTTAGGTTATGAATTGCGTGAGGTTGTGGCAAATGAGAGGCAATTTAGGGCTTCTGCGGGGGCAATTGGACCGGCTTTGGATACGATCGACGATTTTGCTGGAGAGGATGCTGAGTTCAAGACGATGAAAGAGTTTATAGATCTTGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAGCGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAGAAAATGAAGAATCTCGGTGACGATAGCAATGTAGGTGCGGCGAAAACAGAGGCTATCAAGCCGAGAGTGCTTGAAATCAGGGAGACGCGTTCCGAGGTCGGAGAATACGGATTGGGAAGAAGGTCTTGTGATTCAGATCCAAGATTCTCTGTCGATGTAGGTAGAATGTCGTTGGATGATTCACGGTATTCGTTCGATGAGCCAAGGGCTTCTTGGGATGGGTATCTGATTGGTAGAACTTATCCAAGGCTTACGCCGATGGTTTCAGTTTTGGAGGAGGTCAAATTACCTGGTATTGGATTTGAGAAAGACGGTCCTTCTGATGAAGCAGAAGGGTCTCCGATGAATGTAGGAGAAAAGATTCCCGGTGGATCGGCTCAGACTAAAGATTACTATATGGATTCATTGTCTTCTGTAAGGCGGAGGAAGAGTTTTGATCGTTCAAGTTCACACAGAAAAGGGGCCTCAGCGGATTTCGATGACTTGAAATTAATATCAAACGCAAAGGTATCTCCTGCAACCACAGAGTTGTTCTATGGTGCAAAGGTGCTAATTACAGAGAAAGATTTGAAGGACTCACACTCAAAAGCAACAAGAGATGGCGATTTGAGTGGCACTGATGTTACTTCAAAAGATTCTGTTCCCGATGCAGCTGGGATTGATCGAAAGACGTTCAAGAAGGTGCATAGATGGCGTAAAGTATTAAGTGTTCTGGGTATGATGCAAAAGCGAAGTGGAGAGAGTAAGTCCGATGATGAAGAAAGCAGTGTTGGAGGAAATGTGGTTGATCGGCCTTATGCCGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAATGGAGAAGCAAACGGTTCTGTTAGCCAGAAGCTCATTCGCAGTTACAGCGTAAGCTGTCGAGATCCCAGCAAACTAGGTGGATTTAATGGCGGTAATGATTCGAAACTGAACGGTTTGAGGCGGAGAGACGACTTTACGTTGCAGAGGAATCGGAGCGTCAGGTATTCACCAAATAACTTCGATAATGGCTTATTAAGGTTCTACTTGACACCATTGAGAAGCTACAACAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTCCTTTCAATGTCAAACATGTCATGTAA

Protein sequence

MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELREVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGFEKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHVM
Homology
BLAST of Tan0006674 vs. ExPASy Swiss-Prot
Match: Q9SS80 (Protein OCTOPUS OS=Arabidopsis thaliana OX=3702 GN=OPS PE=1 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 1.3e-65
Identity = 225/667 (33.73%), Postives = 313/667 (46.93%), Query Frame = 0

Query: 10  HRLST-CHRHPSKPVTGFCAYCLRERLAGID-------------PDTRQESPARNLHSSS 69
           HRLST C+RHP +  TGFC  CL ERL+ +D             P T   +  + L   S
Sbjct: 25  HRLSTSCNRHPEERFTGFCPSCLCERLSVLDQTNNGGSSSSSKKPPTISAAALKALFKPS 84

Query: 70  ----------------------ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSD 129
                                 ELRR+KSFSA+K   G        R+SCD R  +SL +
Sbjct: 85  GNNGVGGVNTNGNGRVKPGFFPELRRTKSFSASKNNEGFSGVFEPQRRSCDVRLRSSLWN 144

Query: 130 LFCRED---------------KPRCTN-----------REVEIESENLGYELREVVANER 189
           LF +++               +PR ++            E E + E L  E  E      
Sbjct: 145 LFSQDEQRNLPSNVTGGEIDVEPRKSSVAEPVLEVNDEGEAESDDEELEEEEEEDYVEAG 204

Query: 190 QF----------RASAGAIGPALDTIDDFAG-----EDAEFKTMKEFIDLEFRRKKNAGR 249
            F          R  +  I    + I++         + E K +K++IDL+ + KK +  
Sbjct: 205 DFEILNDSGELMREKSDEIVEVREEIEEAVKPTKGLSEEELKPIKDYIDLDSQTKKPS-- 264

Query: 250 DLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GDDSNVGAAKTEAIKPRVLEIRETRS 309
               +  S W AASVFSKKL KWR+ QKMK    G D   G+A+    KP   ++R+T+S
Sbjct: 265 ----VRRSFWSAASVFSKKLQKWRQNQKMKKRRNGGDHRPGSARLPVEKPIGRQLRDTQS 324

Query: 310 EVGEYGLGRRSCDSDP--------------RFSVDVGRMSLDDSRYSFDEPRASWDGYLI 369
           E+ +YG GRRSCD+DP              RFSVD+GR+SLDD RYSFDEPRASWDG LI
Sbjct: 325 EIADYGYGRRSCDTDPRFSLDAGRFSLDAGRFSVDIGRISLDDPRYSFDEPRASWDGSLI 384

Query: 370 GRTY-------PRLTPMVSVLEEVKLP----------GIGFEKDGPSDEAEGSPMNVGEK 429
           GRT        P    M+SV+E+   P              E+  P          V + 
Sbjct: 385 GRTMFPPAARAPPPPSMLSVVEDAPPPVHRHVTRADMQFPVEEPAPPPPVVNQTNGVSDP 444

Query: 430 --IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPAT 489
             IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+   AD D+ KL  ++ +S   
Sbjct: 445 VIIPGGSIQTRDYYTD--SSSRRRKSLDRSSSSMRKTAAAVVADMDEPKLSVSSAIS--- 504

Query: 490 TELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKV 549
            + + G+        L+D+++ A    D +G+       + D         KK  RW K 
Sbjct: 505 IDAYSGS--------LRDNNNYAVETAD-NGSFREPAMMIGDRKVNSNDNNKKSRRWGK- 564

Query: 550 LSVLGMMQKRSGESKSDDEESS------VGGNVVDRPYAESWEKLRRVANGEANGSVSQK 555
            S+LG++ ++S     ++EE        + G +V+R  +ESW +LR   NG   G   + 
Sbjct: 565 WSILGLIYRKSVNKYEEEEEEEEDRYRRLNGGMVERSLSESWPELR---NGGGGGGGPRM 624

BLAST of Tan0006674 vs. ExPASy Swiss-Prot
Match: Q9LFB9 (Protein OCTOPUS-like OS=Arabidopsis thaliana OX=3702 GN=OPSL1 PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 2.0e-53
Identity = 203/593 (34.23%), Postives = 280/593 (47.22%), Query Frame = 0

Query: 10  HRLST-CHRHPSKPVTGFCAYCLRERLAGID------PDTRQESP---------ARNLHS 69
           HRLST C  HP +  +GFC  CL +RL+ +D      P +    P         A    S
Sbjct: 23  HRLSTSCDLHPEERFSGFCPSCLCDRLSVLDHNAAPPPSSSSRKPPSISAVSLKALFKPS 82

Query: 70  SS-----------------ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFC 129
           SS                 ELRR+KSFSA   E   G  E Q R+SCD R  +   +L  
Sbjct: 83  SSGTNNSNGNGRVRPGFFPELRRTKSFSAKNNEGFSGGFEPQ-RRSCDVRLRDDERNLPI 142

Query: 130 RE----DKPRCTNREVEIESENLGY-ELREVVANERQFRASAGAIGPALDTIDDFAGEDA 189
            E    DK     RE  +    L   E  E+  +E       G I    +   +   E+ 
Sbjct: 143 NEAASVDKIEEEARESSVSEIVLEVTEEAEIEEDEENGEKDPGEI--VEEKSSEIGEEEE 202

Query: 190 EFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVG 249
           E K MK+++DL  + KK +   +++ AGS + AASVFSKKL KW++KQK+K     + VG
Sbjct: 203 ELKPMKDYMDLYSQTKKPS---VKDFAGSFFSAASVFSKKLQKWKQKQKVKK--PRNGVG 262

Query: 250 AAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDP-------RFSVDVGRMSLDDSRYSF 309
             + ++                E G+GRRS D+DP       RFSVD+GR+S+DDSRYS 
Sbjct: 263 GGRPQS----------------EIGVGRRSSDTDPRFSLDAGRFSVDIGRISMDDSRYSL 322

Query: 310 DEPRASWDGYLIGRTYPRLTP----MVSVLEEVKLPGIGFE-KDGPSDEAEGSPMNVGEK 369
           DEPRASWDG+LIGRT     P    M+SV+E   L     +    PS +      +    
Sbjct: 323 DEPRASWDGHLIGRTTAARVPLPPSMLSVVENAPLNRSDMQIPSSPSIKPISGDSDPIII 382

Query: 370 IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYG 429
           IPGGS QT+DYY    SS RRRKS DRS+S RK    + +D+K +SN+  +         
Sbjct: 383 IPGGSNQTRDYYTGPPSS-RRRKSLDRSNSIRK-IVTELEDVKSVSNSTTT--------- 442

Query: 430 AKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGM 489
               I    ++ + +K  ++GD                       KK  RW K  S+LG 
Sbjct: 443 ----IDSNSMETAENKGNQNGD-----------------------KKSRRWGK-WSILGF 502

Query: 490 MQKRSGESKSDDEES-SVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYS-VSCR 549
           + ++  + + +D  S S    +V+R  +ESW ++R   NGE  G    K+ RS S VS R
Sbjct: 503 IYRKGKDDEEEDRYSRSNSAGMVERSLSESWPEMR---NGEGGG---PKMRRSNSNVSWR 525

Query: 550 DPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR 551
                   +GG  +              RN+S RYS  + +NG+LRFYLTP+R
Sbjct: 563 S-------SGGGSA--------------RNKSSRYSSKDGENGMLRFYLTPMR 525

BLAST of Tan0006674 vs. NCBI nr
Match: XP_022948149.1 (UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1048.5 bits (2710), Expect = 2.1e-302
Identity = 534/574 (93.03%), Postives = 552/574 (96.17%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRS
Sbjct: 1   MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KSFSAAKREAGIG+PEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLG+ELR
Sbjct: 61  KSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VWEAASVFSKKLGKWRKKQKMKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           EKD PSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDD
Sbjct: 301 EKDDPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGT++TSKDSVPDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANG 480
           RKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANG
Sbjct: 421 RKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANG 480

Query: 481 EANGSVSQKLIRSYSVSCRDPSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNF 540
           EANGSVSQKLIRSYSVSCRDPSKL GFN GGNDSKL GLRRRDDFTLQRNRS RYSPNNF
Sbjct: 481 EANGSVSQKLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSARYSPNNF 540

Query: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 572
           DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Sbjct: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 574

BLAST of Tan0006674 vs. NCBI nr
Match: KAG6605428.1 (Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1046.6 bits (2705), Expect = 7.9e-302
Identity = 533/574 (92.86%), Postives = 552/574 (96.17%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRS
Sbjct: 1   MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KSFSAAKREAGIG+PEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLG+ELR
Sbjct: 61  KSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VWEAASVFSKKLGKWRKKQKMKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           EKD P DEAEGSPMNVG+KIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDD
Sbjct: 301 EKDDPYDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGT++TSKDSVPDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANG 480
           RKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANG
Sbjct: 421 RKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANG 480

Query: 481 EANGSVSQKLIRSYSVSCRDPSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNF 540
           EANGSVSQKLIRSYSVSCRDPSKL GFN GGNDSKL GLRRRDDFTLQRNRSVRYSPNNF
Sbjct: 481 EANGSVSQKLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSVRYSPNNF 540

Query: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 572
           DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Sbjct: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 574

BLAST of Tan0006674 vs. NCBI nr
Match: XP_023532200.1 (UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1035.4 bits (2676), Expect = 1.8e-298
Identity = 527/575 (91.65%), Postives = 548/575 (95.30%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRS
Sbjct: 1   MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KSFSAAKRE GIG+PEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLG+ELR
Sbjct: 61  KSFSAAKREVGIGRPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLRE+AGS
Sbjct: 121 EVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLRELAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VW AASVFSKKLGKWRKKQKMKNLG+D++VGAAKTEA+KPRVLE+RETRSEVGEYGLGRR
Sbjct: 181 VWGAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAVKPRVLEVRETRSEVGEYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEEAKLPGIGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDD
Sbjct: 301 EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LKLISNAKVSPATTELFYGAKVLITEKDLKDS SKATRDGDLSGT++TSKD VPDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSQSKATRDGDLSGTNITSKDFVPDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANG 480
           RKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANG
Sbjct: 421 RKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFVAESWEKLRRVANG 480

Query: 481 EANGSVSQKLIRSYSVSCRDPSKLGGFN--GGNDSKLNGLRRRDDFTLQRNRSVRYSPNN 540
           EANGSVSQKLIRSYSVSCRDPSKL GFN  GGNDSKL GLRRRDD TLQRNRSVRYSPNN
Sbjct: 481 EANGSVSQKLIRSYSVSCRDPSKLAGFNGGGGNDSKLYGLRRRDDLTLQRNRSVRYSPNN 540

Query: 541 FDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 572
           FDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Sbjct: 541 FDNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 575

BLAST of Tan0006674 vs. NCBI nr
Match: XP_023007551.1 (UPF0503 protein At3g09070, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1033.5 bits (2671), Expect = 7.0e-298
Identity = 529/574 (92.16%), Postives = 548/574 (95.47%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRS
Sbjct: 1   MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KSFSAAKREAGIG+PEVQHRKSCDARSG+SLSDLFCREDKPRCT +EVEIESENLG+ELR
Sbjct: 61  KSFSAAKREAGIGRPEVQHRKSCDARSGDSLSDLFCREDKPRCT-KEVEIESENLGFELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VWEAASVFSKKLGKWRKKQKMKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEEAKLPGIGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH+KGASADFDD
Sbjct: 301 EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHKKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGT++TSKDSVPDA GID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDATGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANG 480
           RKTFKKVHRWRKVLSVLGM QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANG
Sbjct: 421 RKTFKKVHRWRKVLSVLGMFQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANG 480

Query: 481 EANGSVSQKLIRSYSVSCRDPSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNF 540
           EANGSVSQKLIRSYSVSCRDPSKL G N GGNDSKL GLRRRDDFTLQRNRSVRYSP NF
Sbjct: 481 EANGSVSQKLIRSYSVSCRDPSKLAGINGGGNDSKLYGLRRRDDFTLQRNRSVRYSPKNF 540

Query: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 572
           DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Sbjct: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 573

BLAST of Tan0006674 vs. NCBI nr
Match: XP_038901013.1 (protein OCTOPUS [Benincasa hispida])

HSP 1 Score: 1009.6 bits (2609), Expect = 1.1e-290
Identity = 514/574 (89.55%), Postives = 533/574 (92.86%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNLQ+K +SHRLSTCHRHPSKPVTGFCA CLRERLAGID DT+QESP  N HSSSELRRS
Sbjct: 1   MNLQLKTLSHRLSTCHRHPSKPVTGFCASCLRERLAGIDTDTQQESPVPNNHSSSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KS+SAAKREAGI Q EVQHRKSCD RSGNSLSDLFCREDKPRCT REVEIESENLG ELR
Sbjct: 61  KSYSAAKREAGIEQSEVQHRKSCDVRSGNSLSDLFCREDKPRCTIREVEIESENLGSELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVANER FRAS G IGPAL TIDDFAGE+AEFKT+KEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVANERLFRASEGIIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VWEAASVFSKKLGKWRKKQK KNL ++ NVG  K E IKPRVLEIRETRSEVG+YGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLSNNGNVGTVKAEDIKPRVLEIRETRSEVGDYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE K  G GF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGAS DFDD
Sbjct: 301 EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSCSHRKGASGDFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LKLISNAKVSPATTELFYGAKVLITEKDL +SHSKATR+GDLSGTDVTSKDSVPDAA ID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLNNSHSKATREGDLSGTDVTSKDSVPDAAVID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEA 480
           RKTFKKVHRWRKVLSVLGM+QKRSGESKSDDEESSVGGNVVDRP AESWEKLRRVANGEA
Sbjct: 421 RKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEA 480

Query: 481 NGSVSQKLIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNG 540
           N  VSQKLIRSYSVSCRDPSKL GFNG NDSKLN  R RDDFTLQRNRSVRYSPNNFDNG
Sbjct: 481 NSCVSQKLIRSYSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNG 540

Query: 541 LLRFYLTPLRSY-NRGKPGKSRPRSSPFNVKHVM 574
           LLRFYLTPLRSY +RGKPGKSRPR+SPFNVKHVM
Sbjct: 541 LLRFYLTPLRSYSSRGKPGKSRPRNSPFNVKHVM 574

BLAST of Tan0006674 vs. ExPASy TrEMBL
Match: A0A6J1G914 (UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111451814 PE=4 SV=1)

HSP 1 Score: 1048.5 bits (2710), Expect = 1.0e-302
Identity = 534/574 (93.03%), Postives = 552/574 (96.17%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRS
Sbjct: 1   MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KSFSAAKREAGIG+PEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLG+ELR
Sbjct: 61  KSFSAAKREAGIGRPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGFELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VWEAASVFSKKLGKWRKKQKMKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEETKLPGIGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           EKD PSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLS+VRRRKSFDRSSSH+KGASADFDD
Sbjct: 301 EKDDPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSTVRRRKSFDRSSSHKKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGT++TSKDSVPDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANG 480
           RKTFKKVHRWRKVLSVLGM+QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANG
Sbjct: 421 RKTFKKVHRWRKVLSVLGMLQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANG 480

Query: 481 EANGSVSQKLIRSYSVSCRDPSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNF 540
           EANGSVSQKLIRSYSVSCRDPSKL GFN GGNDSKL GLRRRDDFTLQRNRS RYSPNNF
Sbjct: 481 EANGSVSQKLIRSYSVSCRDPSKLAGFNGGGNDSKLYGLRRRDDFTLQRNRSARYSPNNF 540

Query: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 572
           DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Sbjct: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 574

BLAST of Tan0006674 vs. ExPASy TrEMBL
Match: A0A6J1L0V5 (UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111500011 PE=4 SV=1)

HSP 1 Score: 1033.5 bits (2671), Expect = 3.4e-298
Identity = 529/574 (92.16%), Postives = 548/574 (95.47%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNLQ KAVSHRLS+C RHPS+PVTGFCA CLRERLAGID DTRQESP  N HS+SELRRS
Sbjct: 1   MNLQNKAVSHRLSSCRRHPSEPVTGFCASCLRERLAGIDSDTRQESPVLNQHSTSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KSFSAAKREAGIG+PEVQHRKSCDARSG+SLSDLFCREDKPRCT +EVEIESENLG+ELR
Sbjct: 61  KSFSAAKREAGIGRPEVQHRKSCDARSGDSLSDLFCREDKPRCT-KEVEIESENLGFELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVA ERQFRAS G IGPALD IDDF+GEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVAKERQFRASEGVIGPALDIIDDFSGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VWEAASVFSKKLGKWRKKQKMKNLG+D++VGAAKTEAIKPRVLE+RETRSEVGEYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKMKNLGNDTDVGAAKTEAIKPRVLEVRETRSEVGEYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRT+PR TPMVSVLEE KLPGIGF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTHPRPTPMVSVLEEAKLPGIGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           EKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH+KGASADFDD
Sbjct: 301 EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHKKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGT++TSKDSVPDA GID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTNITSKDSVPDATGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGE-SKSDDEESSVGGNVVDRPY-AESWEKLRRVANG 480
           RKTFKKVHRWRKVLSVLGM QKRSGE SKSDDEES VGGNVVDRP+ AESWEKLRRVANG
Sbjct: 421 RKTFKKVHRWRKVLSVLGMFQKRSGESSKSDDEESCVGGNVVDRPFAAESWEKLRRVANG 480

Query: 481 EANGSVSQKLIRSYSVSCRDPSKLGGFN-GGNDSKLNGLRRRDDFTLQRNRSVRYSPNNF 540
           EANGSVSQKLIRSYSVSCRDPSKL G N GGNDSKL GLRRRDDFTLQRNRSVRYSP NF
Sbjct: 481 EANGSVSQKLIRSYSVSCRDPSKLAGINGGGNDSKLYGLRRRDDFTLQRNRSVRYSPKNF 540

Query: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 572
           DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH
Sbjct: 541 DNGLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKH 573

BLAST of Tan0006674 vs. ExPASy TrEMBL
Match: A0A6J1D4T1 (UPF0503 protein At3g09070, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111017017 PE=4 SV=1)

HSP 1 Score: 998.0 bits (2579), Expect = 1.6e-287
Identity = 504/572 (88.11%), Postives = 524/572 (91.61%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRS 60
           MNL  K V HRLSTC RHPSKPVTGFCAYCLRERLAGIDPDTRQE+P RN HSSSELRRS
Sbjct: 1   MNLHAKTVPHRLSTCQRHPSKPVTGFCAYCLRERLAGIDPDTRQETPPRNQHSSSELRRS 60

Query: 61  KSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELR 120
           KSFSAAKR+AGIGQPEVQHRKSCD RSGNSLSDLFCRED+P+C N+EVEIESENLG+ELR
Sbjct: 61  KSFSAAKRDAGIGQPEVQHRKSCDVRSGNSLSDLFCREDRPKCPNQEVEIESENLGFELR 120

Query: 121 EVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAGS 180
           EV ANERQFRAS GAIGP LDTIDDFAG +AEFKTMKEFIDLE RRKKN GRDLREIAGS
Sbjct: 121 EVAANERQFRASEGAIGPPLDTIDDFAGGEAEFKTMKEFIDLEIRRKKNTGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 240
           VWEAASV SKKLGKWRKKQKMKNLG+ SN G  KTE  KPR+LE RETRSEVGEYGLGRR
Sbjct: 181 VWEAASVISKKLGKWRKKQKMKNLGNSSNAGTVKTEGFKPRMLETRETRSEVGEYGLGRR 240

Query: 241 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 300
           SCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIGRTYPRL PMVSVLEEVK PG GF
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLAPMVSVLEEVKFPGNGF 300

Query: 301 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 360
           E   P DEAEG  MNVG+KIPGGSAQTKDYYM+SLSS+RRRKSFDRSSSHRKGASADFDD
Sbjct: 301 ENGDPPDEAEGPLMNVGDKIPGGSAQTKDYYMESLSSLRRRKSFDRSSSHRKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 420
           LK ISNAKVSPATTELFYGAKVLITEKDL DSHSK+TRDGDLSG++VTSKDSVPDAAG D
Sbjct: 361 LKSISNAKVSPATTELFYGAKVLITEKDLNDSHSKSTRDGDLSGSEVTSKDSVPDAAGSD 420

Query: 421 RKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEA 480
           RKTFKK +RW+KVL VLGMMQKRS ESKSDDEE  VG N VDRP AESWEKLRRVANGEA
Sbjct: 421 RKTFKKAYRWQKVLKVLGMMQKRS-ESKSDDEEGCVGVNSVDRPLAESWEKLRRVANGEA 480

Query: 481 NGSVSQKLIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNG 540
           N SVSQKLIRSYSVSCRDP+KL GFNGGND KLNGLRRRDD TLQRNRSVRYSPNNFDNG
Sbjct: 481 NCSVSQKLIRSYSVSCRDPNKLAGFNGGNDLKLNGLRRRDDLTLQRNRSVRYSPNNFDNG 540

Query: 541 LLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHV 573
           LLRFYLTPLRSY+RGKPGKSRPRSSPFNVKHV
Sbjct: 541 LLRFYLTPLRSYSRGKPGKSRPRSSPFNVKHV 571

BLAST of Tan0006674 vs. ExPASy TrEMBL
Match: A0A5A7V3J1 (UPF0503 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold571G00450 PE=4 SV=1)

HSP 1 Score: 993.0 bits (2566), Expect = 5.0e-286
Identity = 505/574 (87.98%), Postives = 526/574 (91.64%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSS-ELRR 60
           MNLQ+K+VSHRLSTCHRHPSKPVTGFCA CLRERLAGIDPD + ESP  N HSSS ELRR
Sbjct: 1   MNLQLKSVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDLQHESPLPNNHSSSAELRR 60

Query: 61  SKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYEL 120
           SKS+SAAK EAGIGQ E+QHRKSCD RSGNSLSDLFCREDKPRCTN EVEIESENLG+EL
Sbjct: 61  SKSYSAAKCEAGIGQSELQHRKSCDVRSGNSLSDLFCREDKPRCTNPEVEIESENLGFEL 120

Query: 121 REVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAG 180
           REVV N RQFRAS G IGP L TIDDFAGEDAEFKT+KEFIDLEFRRKKNAGRDLREIAG
Sbjct: 121 REVVGNGRQFRASEGIIGPGLGTIDDFAGEDAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180

Query: 181 SVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGR 240
           SVWEAASVFSKKL KWRKKQK KNLG++SNVGA K E IKPR LEIRETRSEVGEYGLGR
Sbjct: 181 SVWEAASVFSKKLSKWRKKQKRKNLGNNSNVGAVKVEDIKPRALEIRETRSEVGEYGLGR 240

Query: 241 RSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIG 300
           RSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE K  G G
Sbjct: 241 RSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGAG 300

Query: 301 FEKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFD 360
           FEKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGAS DFD
Sbjct: 301 FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSCSHRKGASGDFD 360

Query: 361 DLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGI 420
           +LKLISNAKVSPATTELFYGAKVLITEKDL  S  KAT DGDLSGTDVTSKDSVPDA  I
Sbjct: 361 ELKLISNAKVSPATTELFYGAKVLITEKDLNSSRPKATGDGDLSGTDVTSKDSVPDAPVI 420

Query: 421 DRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGE 480
           DRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP  ESWEKLRRVANGE
Sbjct: 421 DRKSFKKVHRWRKVLSVLGMIQKRNGESKSDDEESSVAGNVVDRPVVESWEKLRRVANGE 480

Query: 481 ANGSVSQKLIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDN 540
           AN  VSQKLIRSYSVSCRDPSKL GFNGGNDSKLN  R RDDFTLQRNRSVRYSPNNFDN
Sbjct: 481 ANSCVSQKLIRSYSVSCRDPSKLAGFNGGNDSKLNVTRWRDDFTLQRNRSVRYSPNNFDN 540

Query: 541 GLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHVM 574
           GLLRFYLTPLRSY+RGK GKSRPR+SPFNVKHV+
Sbjct: 541 GLLRFYLTPLRSYSRGKLGKSRPRNSPFNVKHVI 574

BLAST of Tan0006674 vs. ExPASy TrEMBL
Match: A0A1S3CKL0 (UPF0503 protein At3g09070, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502039 PE=4 SV=1)

HSP 1 Score: 993.0 bits (2566), Expect = 5.0e-286
Identity = 505/574 (87.98%), Postives = 526/574 (91.64%), Query Frame = 0

Query: 1   MNLQVKAVSHRLSTCHRHPSKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSS-ELRR 60
           MNLQ+K+VSHRLSTCHRHPSKPVTGFCA CLRERLAGIDPD + ESP  N HSSS ELRR
Sbjct: 1   MNLQLKSVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDLQHESPLPNNHSSSAELRR 60

Query: 61  SKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYEL 120
           SKS+SAAK EAGIGQ E+QHRKSCD RSGNSLSDLFCREDKPRCTN EVEIESENLG+EL
Sbjct: 61  SKSYSAAKCEAGIGQSELQHRKSCDVRSGNSLSDLFCREDKPRCTNPEVEIESENLGFEL 120

Query: 121 REVVANERQFRASAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRKKNAGRDLREIAG 180
           REVV N RQFRAS G IGP L TIDDFAGEDAEFKT+KEFIDLEFRRKKNAGRDLREIAG
Sbjct: 121 REVVGNGRQFRASEGIIGPGLGTIDDFAGEDAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180

Query: 181 SVWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGR 240
           SVWEAASVFSKKL KWRKKQK KNLG++SNVGA K E IKPR LEIRETRSEVGEYGLGR
Sbjct: 181 SVWEAASVFSKKLSKWRKKQKRKNLGNNSNVGAVKVEDIKPRALEIRETRSEVGEYGLGR 240

Query: 241 RSCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIG 300
           RSCD+DPRFSVD GRMSLDDSRYSFDEPRASWDGYLIG+TYPR+TPMVSVLEE K  G G
Sbjct: 241 RSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGAG 300

Query: 301 FEKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFD 360
           FEKD PSDEAEGSPMNVG+KIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGAS DFD
Sbjct: 301 FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSCSHRKGASGDFD 360

Query: 361 DLKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGI 420
           +LKLISNAKVSPATTELFYGAKVLITEKDL  S  KAT DGDLSGTDVTSKDSVPDA  I
Sbjct: 361 ELKLISNAKVSPATTELFYGAKVLITEKDLNSSRPKATGDGDLSGTDVTSKDSVPDAPVI 420

Query: 421 DRKTFKKVHRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGE 480
           DRK+FKKVHRWRKVLSVLGM+QKR+GESKSDDEESSV GNVVDRP  ESWEKLRRVANGE
Sbjct: 421 DRKSFKKVHRWRKVLSVLGMIQKRNGESKSDDEESSVAGNVVDRPVVESWEKLRRVANGE 480

Query: 481 ANGSVSQKLIRSYSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDN 540
           AN  VSQKLIRSYSVSCRDPSKL GFNGGNDSKLN  R RDDFTLQRNRSVRYSPNNFDN
Sbjct: 481 ANSCVSQKLIRSYSVSCRDPSKLAGFNGGNDSKLNVTRWRDDFTLQRNRSVRYSPNNFDN 540

Query: 541 GLLRFYLTPLRSYNRGKPGKSRPRSSPFNVKHVM 574
           GLLRFYLTPLRSY+RGK GKSRPR+SPFNVKHV+
Sbjct: 541 GLLRFYLTPLRSYSRGKLGKSRPRNSPFNVKHVI 574

BLAST of Tan0006674 vs. TAIR 10
Match: AT5G58930.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 280.0 bits (715), Expect = 4.2e-75
Identity = 219/593 (36.93%), Postives = 303/593 (51.10%), Query Frame = 0

Query: 13  STCHRHP-SKPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAG 72
           + CHRHP SKP TGFCA CLRERL+ I      E+ + ++ +S+ELRR +S+S   R+A 
Sbjct: 15  AVCHRHPSSKPTTGFCATCLRERLSTI------EALSSSVSASTELRRVRSYSV--RDAS 74

Query: 73  IGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTNREVEIESENLGYELREVVANERQFRA 132
               +   R+SCD RS +   D             + E+   ++ + +   +  + +   
Sbjct: 75  ASVLDQPRRRSCDVRSNHDDDD-------------DDELLKSSIRFPIVPDLIEDEEEED 134

Query: 133 SAGAIGPALDTIDDFAGEDAEFKTMKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFS 192
             G        + +   ED E KTMKE IDLE R +  KN G+D            SVFS
Sbjct: 135 DEG------KKLVEEEIEDGEQKTMKELIDLESRNQQLKNNGKD------------SVFS 194

Query: 193 KKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDPRFS 252
           + L K+  K   K + D  N                           LGRRSCD DPR S
Sbjct: 195 RTLRKFSLKHHRK-IPDSGN--------------------------SLGRRSCDVDPRLS 254

Query: 253 VDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGFEKDGPSDEA 312
           +D GR+       SFDEPRASWDG LIG+TYP+L P+ SV E+VK            ++ 
Sbjct: 255 LDAGRV-------SFDEPRASWDGCLIGKTYPKLIPLSSVTEDVK---------ASPEKI 314

Query: 313 EGSPMNVGEK-IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAK 372
            G  +   EK  PGG+AQT+DYY+DS    RRR+SFDRSS H      + D+LK ISNAK
Sbjct: 315 TGEKVEEDEKNNPGGTAQTRDYYLDS----RRRRSFDRSSRH---GLLEVDELKAISNAK 374

Query: 373 VSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRK-----T 432
           VSP T  LF+GAK+L+TE++L+DS+  + ++      ++ SK     AAG  +K      
Sbjct: 375 VSPETVGLFHGAKLLVTERELRDSNWYSIKNYKPESLELGSKGVGCVAAGEVKKQDGFGL 434

Query: 433 FKKVHRWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPYAESWEKLRRVANGEA 492
            K    W K  +  G++Q+++  +K++   ++   +GGN ++   AES  KLRRVA GE 
Sbjct: 435 KKSGKNWSKGWNFWGLIQRKTDVAKNEMKTEQSLKLGGNTMEGSLAESLLKLRRVAKGET 494

Query: 493 NGSVSQKLIRSYSVSC--------RDPSKLGGFNGG-----------------------N 552
           NG VS+KLIRSYSVS         R  S + GF GG                        
Sbjct: 495 NGDVSEKLIRSYSVSARKSCDGMLRGASIVNGFEGGRSSCDGLFHGSITGVETGRRSLCE 518

Query: 553 DSKLNGLRRRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYNRGKPGKSR 562
           D   +G+  + +  LQ +  +  YSP+N  NG++RFYLTPL S+   K GKSR
Sbjct: 555 DGMFHGVEGKRNHLLQSDDKLGTYSPDNLRNGMVRFYLTPLNSHMTSKSGKSR 518

BLAST of Tan0006674 vs. TAIR 10
Match: AT3G46990.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 256.9 bits (655), Expect = 3.8e-68
Identity = 211/601 (35.11%), Postives = 303/601 (50.42%), Query Frame = 0

Query: 13  STCHRHPS-KPVTGFCAYCLRERLAGIDPDTRQESPARNLHSSSELRRSKSFSAAKREAG 72
           S+CHRHPS KP +GFCA CLRERL  I+     +S +     + ELRR +S+S   R A 
Sbjct: 14  SSCHRHPSAKPTSGFCASCLRERLVTIE----AQSSSLAAVQTPELRRIRSYSV--RNAS 73

Query: 73  IGQPEVQHRKSCDAR-SGNSLSDLFCREDKPRCTNREVEIESENLGYELREVVANERQFR 132
           +   +   R+SCD R S +SL DLF  +D+ R  +   +    +L  E  E    E  + 
Sbjct: 74  VSVSDQPRRRSCDVRSSASSLLDLFVDDDEERVDSSIRKPLVPDLKEEEEEEEEEEDYYD 133

Query: 133 ASAGAIGPALDTIDDFAGED--------AEFKTMKEFIDLEFRR--KKNAGRDLREIAGS 192
                        +D  G D         E KTMKEFIDL++R   KKN G+DL+EI   
Sbjct: 134 G------------EDIKGFDEGKPRKIVEENKTMKEFIDLDWRNQIKKNNGKDLKEI--- 193

Query: 193 VWEAASVFSKKLGKWRKKQKMKNLGDDSNVGAAKTEAIKPRVLEIRETRSEVGEYGLGRR 252
               ASV S++L  +   ++     D    G                          GR 
Sbjct: 194 ----ASVLSRRLKNFTLNKRNDEKSDSRFAGIVN-----------------------GRH 253

Query: 253 SCDSDPRFSVDVGRMSLDDSRYSFDEPRASWDGYLIGRTYPRLTPMVSVLEEVKLPGIGF 312
           S D DPR S D GR+       SF++PR+SWDG LI ++Y +LT + +V E+ K    G 
Sbjct: 254 SSDVDPRLSFDGGRI-------SFEKPRSSWDGCLIEKSYHKLTTLSTVTEDAKAK-CGV 313

Query: 313 EKDGPSDEAEGSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDD 372
           E++   ++         EK PGG+ QTK+YY DS    RRR+SFDRS S ++    + D+
Sbjct: 314 EEEEVEEK---------EKSPGGTVQTKNYYSDS----RRRRSFDRSVSIKRQGLLEVDE 373

Query: 373 LKLISNAKVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGID 432
           L+ ISNAKVSP T  LF+GAK+L+TEK+L+DS+  + ++      ++ SK  +  AAG +
Sbjct: 374 LRGISNAKVSPETVGLFHGAKLLVTEKELRDSNWYSIKNVKPESKELVSKGKICIAAGGE 433

Query: 433 RKTFKKVH------RWRKVLSVLGMMQKRSGESKSD---DEESSVGGNVVDRPYAESWEK 492
            K    V       +W K  ++ G++Q R  E+K++   ++   + GN V+   AES  K
Sbjct: 434 GKKQDSVELKKPRKKWPKGWNIWGLIQ-RKNEAKNEIKTEQILKLEGNAVEGSLAESLLK 493

Query: 493 LRRVANGEANGSVSQKLIRSYSVSCR--------DPSKLGGFNGGN-------------- 552
           LRRV  GE N  VS+KL++SYSVS R          + + GF GG               
Sbjct: 494 LRRVGKGETNVGVSEKLLKSYSVSARKSCDGVRSGANIVSGFEGGRSSCDGLFHGSINSV 544

Query: 553 -------DSKLNGLR-RRDDFTLQRNRSV-RYSPNNFDNGLLRFYLTPLRSYNRGKPGKS 562
                  D  +NG+  +++   LQRN +V   S  N +  + RFYL+P++S+   K GKS
Sbjct: 554 EAGRNSCDGLVNGIEGKQNHHLLQRNANVGTCSQENLEKSMFRFYLSPVKSHKTSKSGKS 544

BLAST of Tan0006674 vs. TAIR 10
Match: AT2G38070.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 256.1 bits (653), Expect = 6.5e-68
Identity = 223/620 (35.97%), Postives = 310/620 (50.00%), Query Frame = 0

Query: 10  HRLST-CHRHPSKPVTGFCAYCLRERLAGIDPDTRQE----SPARNLHSSS--------- 69
           HR ST C RHP +  TGFC  CL +RL+ +D   +      S ++   SSS         
Sbjct: 32  HRPSTSCDRHPDERFTGFCPSCLFDRLSVLDITGKNNNAVASSSKKPPSSSAALKAIFKP 91

Query: 70  ---------ELRRSKSFSAAKREA-GIGQPEVQHRKSCDARSGNSLSDLFCREDKPRCTN 129
                    ELRR+KSFSA+K EA  +G  E Q R+SCD R  N+L  LF  + +     
Sbjct: 92  SSSSGSFFPELRRTKSFSASKAEAFSLGAFEPQ-RRSCDVRVRNTLWSLFHEDAEHNSQT 151

Query: 130 RE------VEIESENLG-----------YELREVVANERQFRASAGAIGPALDTIDDFAG 189
           +E       EI+ E +             E+     NE+  +            ID+   
Sbjct: 152 KEGLSVNCSEIDLERINSIVKSPVFEEETEIESEQDNEKDIKFE--TFKEPRSVIDEIVE 211

Query: 190 EDAEFKTMK-EFIDLEFRRK---KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNL 249
           E+ E +T K E   +EF  +   K   RD +EIAGS W AASVFSKKL KWR+KQK+K  
Sbjct: 212 EEEEEETKKVEDFTMEFNPQTTAKKTNRDFKEIAGSFWSAASVFSKKLQKWRQKQKLKK- 271

Query: 250 GDDSNVGAAKTEAIKPRVL--EIRETRSEVGEYGLGRRSCDSDPRFSVDVGRMSL----- 309
               N+GA  +     + +  ++R+T+SE+ EYG GRRSCD+DPRFS+D GR SL     
Sbjct: 272 HRTGNLGAGSSALPVEKAIGRQLRDTQSEIAEYGYGRRSCDTDPRFSIDAGRFSLDAGRV 331

Query: 310 --DDSRYSFDEPRASWDGYLIGRTYP--RLTPMVSVLEEVKLPGIGFEKDG--PSDEA-E 369
             DD RYSF+EPRASWDGYLIGR     R+  M+SV+E+  +       D   P +++ +
Sbjct: 332 SVDDPRYSFEEPRASWDGYLIGRAAAPMRMPSMLSVVEDSPVRNHVHRSDTHIPVEKSPQ 391

Query: 370 GSPMNVGEKIPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRK---GASADFDDLKLISNA 429
            S   + E +PGGSAQT++YY+DS SS RRRKS DRSSS RK      A+ D+LKL  + 
Sbjct: 392 VSEAVIDEIVPGGSAQTREYYLDS-SSSRRRKSLDRSSSTRKLSASVMAEIDELKLTQDR 451

Query: 430 KVSPATTELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKV 489
           +                  KDL  SHS + RD D    +   +  V +  G      K+ 
Sbjct: 452 EA-----------------KDLV-SHSNSLRD-DCCSVENNYEMGVRENVGTIECNKKRT 511

Query: 490 HRWRKVLSVLGMMQKRSGESKSDDEESSVGGNVVDRPYAESWEKLRRVANGEANGSVSQK 549
            + R   ++ G++ +++G +K ++EE   G   VDR ++ SW       N E       K
Sbjct: 512 KKSRWSWNIFGLLHRKNG-NKYEEEERRSG---VDRTFSGSW-------NVEPRNGFDPK 571

Query: 550 LIRS-YSVSCRDPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYL 567
           +IRS  SVS R     GG          GL+R    ++    S +   +  +NG+L+FYL
Sbjct: 572 MIRSNSSVSWRSSGTTGG----------GLQRN---SVDGYISGKKKVSKAENGMLKFYL 603

BLAST of Tan0006674 vs. TAIR 10
Match: AT3G09070.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 252.3 bits (643), Expect = 9.4e-67
Identity = 225/667 (33.73%), Postives = 313/667 (46.93%), Query Frame = 0

Query: 10  HRLST-CHRHPSKPVTGFCAYCLRERLAGID-------------PDTRQESPARNLHSSS 69
           HRLST C+RHP +  TGFC  CL ERL+ +D             P T   +  + L   S
Sbjct: 25  HRLSTSCNRHPEERFTGFCPSCLCERLSVLDQTNNGGSSSSSKKPPTISAAALKALFKPS 84

Query: 70  ----------------------ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSD 129
                                 ELRR+KSFSA+K   G        R+SCD R  +SL +
Sbjct: 85  GNNGVGGVNTNGNGRVKPGFFPELRRTKSFSASKNNEGFSGVFEPQRRSCDVRLRSSLWN 144

Query: 130 LFCRED---------------KPRCTN-----------REVEIESENLGYELREVVANER 189
           LF +++               +PR ++            E E + E L  E  E      
Sbjct: 145 LFSQDEQRNLPSNVTGGEIDVEPRKSSVAEPVLEVNDEGEAESDDEELEEEEEEDYVEAG 204

Query: 190 QF----------RASAGAIGPALDTIDDFAG-----EDAEFKTMKEFIDLEFRRKKNAGR 249
            F          R  +  I    + I++         + E K +K++IDL+ + KK +  
Sbjct: 205 DFEILNDSGELMREKSDEIVEVREEIEEAVKPTKGLSEEELKPIKDYIDLDSQTKKPS-- 264

Query: 250 DLREIAGSVWEAASVFSKKLGKWRKKQKMKNL--GDDSNVGAAKTEAIKPRVLEIRETRS 309
               +  S W AASVFSKKL KWR+ QKMK    G D   G+A+    KP   ++R+T+S
Sbjct: 265 ----VRRSFWSAASVFSKKLQKWRQNQKMKKRRNGGDHRPGSARLPVEKPIGRQLRDTQS 324

Query: 310 EVGEYGLGRRSCDSDP--------------RFSVDVGRMSLDDSRYSFDEPRASWDGYLI 369
           E+ +YG GRRSCD+DP              RFSVD+GR+SLDD RYSFDEPRASWDG LI
Sbjct: 325 EIADYGYGRRSCDTDPRFSLDAGRFSLDAGRFSVDIGRISLDDPRYSFDEPRASWDGSLI 384

Query: 370 GRTY-------PRLTPMVSVLEEVKLP----------GIGFEKDGPSDEAEGSPMNVGEK 429
           GRT        P    M+SV+E+   P              E+  P          V + 
Sbjct: 385 GRTMFPPAARAPPPPSMLSVVEDAPPPVHRHVTRADMQFPVEEPAPPPPVVNQTNGVSDP 444

Query: 430 --IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPAT 489
             IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+   AD D+ KL  ++ +S   
Sbjct: 445 VIIPGGSIQTRDYYTD--SSSRRRKSLDRSSSSMRKTAAAVVADMDEPKLSVSSAIS--- 504

Query: 490 TELFYGAKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKV 549
            + + G+        L+D+++ A    D +G+       + D         KK  RW K 
Sbjct: 505 IDAYSGS--------LRDNNNYAVETAD-NGSFREPAMMIGDRKVNSNDNNKKSRRWGK- 564

Query: 550 LSVLGMMQKRSGESKSDDEESS------VGGNVVDRPYAESWEKLRRVANGEANGSVSQK 555
            S+LG++ ++S     ++EE        + G +V+R  +ESW +LR   NG   G   + 
Sbjct: 565 WSILGLIYRKSVNKYEEEEEEEEDRYRRLNGGMVERSLSESWPELR---NGGGGGGGPRM 624

BLAST of Tan0006674 vs. TAIR 10
Match: AT5G01170.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 211.8 bits (538), Expect = 1.4e-54
Identity = 203/593 (34.23%), Postives = 280/593 (47.22%), Query Frame = 0

Query: 10  HRLST-CHRHPSKPVTGFCAYCLRERLAGID------PDTRQESP---------ARNLHS 69
           HRLST C  HP +  +GFC  CL +RL+ +D      P +    P         A    S
Sbjct: 23  HRLSTSCDLHPEERFSGFCPSCLCDRLSVLDHNAAPPPSSSSRKPPSISAVSLKALFKPS 82

Query: 70  SS-----------------ELRRSKSFSAAKREAGIGQPEVQHRKSCDARSGNSLSDLFC 129
           SS                 ELRR+KSFSA   E   G  E Q R+SCD R  +   +L  
Sbjct: 83  SSGTNNSNGNGRVRPGFFPELRRTKSFSAKNNEGFSGGFEPQ-RRSCDVRLRDDERNLPI 142

Query: 130 RE----DKPRCTNREVEIESENLGY-ELREVVANERQFRASAGAIGPALDTIDDFAGEDA 189
            E    DK     RE  +    L   E  E+  +E       G I    +   +   E+ 
Sbjct: 143 NEAASVDKIEEEARESSVSEIVLEVTEEAEIEEDEENGEKDPGEI--VEEKSSEIGEEEE 202

Query: 190 EFKTMKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKMKNLGDDSNVG 249
           E K MK+++DL  + KK +   +++ AGS + AASVFSKKL KW++KQK+K     + VG
Sbjct: 203 ELKPMKDYMDLYSQTKKPS---VKDFAGSFFSAASVFSKKLQKWKQKQKVKK--PRNGVG 262

Query: 250 AAKTEAIKPRVLEIRETRSEVGEYGLGRRSCDSDP-------RFSVDVGRMSLDDSRYSF 309
             + ++                E G+GRRS D+DP       RFSVD+GR+S+DDSRYS 
Sbjct: 263 GGRPQS----------------EIGVGRRSSDTDPRFSLDAGRFSVDIGRISMDDSRYSL 322

Query: 310 DEPRASWDGYLIGRTYPRLTP----MVSVLEEVKLPGIGFE-KDGPSDEAEGSPMNVGEK 369
           DEPRASWDG+LIGRT     P    M+SV+E   L     +    PS +      +    
Sbjct: 323 DEPRASWDGHLIGRTTAARVPLPPSMLSVVENAPLNRSDMQIPSSPSIKPISGDSDPIII 382

Query: 370 IPGGSAQTKDYYMDSLSSVRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYG 429
           IPGGS QT+DYY    SS RRRKS DRS+S RK    + +D+K +SN+  +         
Sbjct: 383 IPGGSNQTRDYYTGPPSS-RRRKSLDRSNSIRK-IVTELEDVKSVSNSTTT--------- 442

Query: 430 AKVLITEKDLKDSHSKATRDGDLSGTDVTSKDSVPDAAGIDRKTFKKVHRWRKVLSVLGM 489
               I    ++ + +K  ++GD                       KK  RW K  S+LG 
Sbjct: 443 ----IDSNSMETAENKGNQNGD-----------------------KKSRRWGK-WSILGF 502

Query: 490 MQKRSGESKSDDEES-SVGGNVVDRPYAESWEKLRRVANGEANGSVSQKLIRSYS-VSCR 549
           + ++  + + +D  S S    +V+R  +ESW ++R   NGE  G    K+ RS S VS R
Sbjct: 503 IYRKGKDDEEEDRYSRSNSAGMVERSLSESWPEMR---NGEGGG---PKMRRSNSNVSWR 525

Query: 550 DPSKLGGFNGGNDSKLNGLRRRDDFTLQRNRSVRYSPNNFDNGLLRFYLTPLR 551
                   +GG  +              RN+S RYS  + +NG+LRFYLTP+R
Sbjct: 563 S-------SGGGSA--------------RNKSSRYSSKDGENGMLRFYLTPMR 525

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SS801.3e-6533.73Protein OCTOPUS OS=Arabidopsis thaliana OX=3702 GN=OPS PE=1 SV=1[more]
Q9LFB92.0e-5334.23Protein OCTOPUS-like OS=Arabidopsis thaliana OX=3702 GN=OPSL1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_022948149.12.1e-30293.03UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata][more]
KAG6605428.17.9e-30292.86Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023532200.11.8e-29891.65UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo][more]
XP_023007551.17.0e-29892.16UPF0503 protein At3g09070, chloroplastic-like [Cucurbita maxima][more]
XP_038901013.11.1e-29089.55protein OCTOPUS [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1G9141.0e-30293.03UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A6J1L0V53.4e-29892.16UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC... [more]
A0A6J1D4T11.6e-28788.11UPF0503 protein At3g09070, chloroplastic OS=Momordica charantia OX=3673 GN=LOC11... [more]
A0A5A7V3J15.0e-28687.98UPF0503 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold571G0045... [more]
A0A1S3CKL05.0e-28687.98UPF0503 protein At3g09070, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502039... [more]
Match NameE-valueIdentityDescription
AT5G58930.14.2e-7536.93Protein of unknown function (DUF740) [more]
AT3G46990.13.8e-6835.11Protein of unknown function (DUF740) [more]
AT2G38070.16.5e-6835.97Protein of unknown function (DUF740) [more]
AT3G09070.19.4e-6733.73Protein of unknown function (DUF740) [more]
AT5G01170.11.4e-5434.23Protein of unknown function (DUF740) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008004Protein OCTOPUS-likePFAMPF05340DUF740coord: 147..552
e-value: 4.9E-130
score: 435.1
coord: 8..50
e-value: 9.6E-10
score: 37.7
coord: 51..113
e-value: 2.2E-8
score: 33.3
IPR008004Protein OCTOPUS-likePANTHERPTHR31659PROTEIN: UPF0503-LIKE PROTEIN, PUTATIVE (DUF740)-RELATEDcoord: 4..572
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 40..88
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 299..328
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 390..415
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 552..573
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 441..460
NoneNo IPR availablePANTHERPTHR31659:SF0EMB|CAB61945.1coord: 4..572

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0006674.1Tan0006674.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005886 plasma membrane