Sgr020857 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr020857
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Locationtig00153574: 653139 .. 656917 (-)
RNA-Seq ExpressionSgr020857
SyntenySgr020857
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGATCAAAGAGGCGAGATGATAATGACGTAGATGAGGAAGAGTTAGCTCCAGTGTTAAAGAGGCAAAAATTACTTGGGGAGTTCTCACCTTCTTCTTCTCCACCTGCCTCAGAGAACACTCGGCTTCCTGGTTTTAACTATGGTGATGATGATGATGAAGAAGATTACAAATTTAAACAAAATGGAAGTAGATATGATAGAGATGAAGGGGATGACAATGATGAGGAGGAAGATGATGAACAGAATGATGATGATGCAAGTCATGTAAAGCGAAGCCGTGATGTTGAAGTTCGGAAAGATTGTCCTTATCTGGATACTGTAAACCGACAGGTGCATAGCTTGTTTCTTTTCTGTATATTCATTCTGCATTTCTGCATCTCAGTGCTAAGGCCATTTGTTATTGCAACAAGTTCGTTACATGCCGCTTGCTACTTTGTAAATTTTTGTTCCTTCTAGTGAATTCATATGTGTTGATGCTTTGCAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTGAATGTTTATGCCTGCCTCGTATGTGGCAAGTACTATCAAGGGAGGGGGAAGAAATCTCATGCTTACACTCACAGTCTCGAAGCAGGACACCATGTCTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCCTCATTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCTTATACTTGATCCATTATTTGATTGAAACTTTTGTCTTGTCACCATGGATCGGATCTTGAGGCATTCCTCCTTTTGGAGATGCTAGTTTGCTGTTGTTTGATGTTGGATGAGGTGCTCTTCCACCCTTCGTTTCGGGATAAGGGTCATATTTTTTGGTGCTCTACCTTTTTTGCTACCTTATGAGGGATTTGGCTTGAGAGAAATAGGAGAATTTTTAATGGGACGGAGAGGTCTTGTGAGGAGGTTTGGTCCTATGTTAGATTTAATGCTTCCCTTTCGGCATTAGAATTATCCCTTAGGTTTGATTCTCTTGGATTGGAGCCCTTTGTTGTATAGCTGATTAGTTTTTTGTGGTTCCTTTTGTTGAGCTGTTTTTTTGTATGGCTTTTTGTACTCTTTCATTTCTCTCAATGAAAGCCGGTCTTTAATCAAATAATAATAATAATAATAATAATCTAATTCCCTAGAGGATCTTAGTAGTCCACCTATGAATAATAGTTGTTGTTTTAATAAGTATCTCCTAATTCATCTAAGATACTTATTACACCTAAATGTAGATTTAAAAATTACAATTAGGATTTCAAAAAACTAAAGACTTTTCTAAATGAGACTAGTTAAGAAAAAATATCCTCAAACTTTAGCAGAACATGAAAAATAAAATACTATAAATAATAATAAGTATATATAAATTATAGGGCTTGACTATGAAAAATAATCATACTATCTTTAATGAAATGTGCTTTGATGATCCTTTTGGCACCACGTGCAGGTTTGCCAAAGAGCAGGTAGAGCAGCTTGACAGGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATGGTAATTATATTTCTCATTTATTGTGGATTGTCTTATTTGTTAGATTGGCAGCCAGGCCTGAAATTTGGCTTTTGTTAAACCAGGTGGGGCTTAACAACATTAAGGAAACAGATTTTGTAAATGTGACAATTCAATCCTTAATGAGGGTTACTCCACTCAGGAACTTCTTCCTAATACCCGAGAACTATCAGCACTGCAAATCTCCACTTGTCCAACGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGGAACTTCAAAGGCCAGGTTTGTCTGTCCTGTCATTGACATTATTCTTGACAGTTGTGGTTGAGTCTTTATTTTTGAAATTCATAGATTTTGAATCCTTAAGAATTCATAGGTAAGCCCGCATGAATTTCTGCAAGCAGTCATGAAGGCTAGTAAAAAACATTTCCGAATTGGTGCGCAGTCGGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTTCAAAGAAAAGTAGCAGTATAATCTACGAATGTTTCCAGGTTCTCTTCAATTTCCTAGAGTAATATTGTGAATTATGTTCTGTTGGTACTAGTGATAAACGAAATTTATGTTTTTTTTTTTTGATAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAACATGGTGACGATCAGGATGCTGGAAGTGAAGGTAGCAGTGTCATTATGGAAACTTCTAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCGCCACCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGGTAGGTGTAATTTCCATTGCTGGTGTCTTCAGCGAAGTAGGATACTCATAAAACCACTAGTTCCCAATGTCTTTTGGAAATTTTCATTGGTACTTAATGCGATATTGGATATTTATATTCGGACAATTACAAACATCGACCCTACAACTGATTTGGTATTGAGAGCAAGGTGTAACCAAGATAAGGGAGGAGGCTTTAGAAGAGAACATGGAAAGAACCTGAGGAAGACATCGATATTTTGTTTGTTTCTTGTGCTTGGACTTTCTATTTTGAGTTGGCTTGGGTTCAAGAATCAAAGACCTTTTAATGCTTCTGTTTGGAGGCAATGGTTGTAAGAGCACTCTAAAATTGTTGGTTCGTTCGATAGTTGCTGCTACTTGGGAGATTTGGACAGTGAAAGGCTAAGGGATTTTTGAAAATAAAAGAGGTCTTCCCTTTTTTTCCCCTAAAGAAATGTAGCATATAATTGATTTTGAATTTCTTCTGATCTTCAAACCTTCAAGTTTTGTAATTTTTTATTTTCTAAAAGGATCATGAAAATTGGACAGCTTTGTAATTTCCATATTCCTTTTGTGTAGGGGTTGGGTGCACCCAACCCGTCTCATGTCTTTTAACTTTTTCTTAATAGAAAGGTTCCTTATCTAGGAGGTAATAGCTCGTGGTCAATGAGAAATAATTATGCAATTCTTAGTATATTTAAGGTGATCTAATTTCTGAATGTATGTTTATTTATTTATTTTCTAGGTTCCGCTCTTCAACATTTTGAAGAAATTTGATGGTGAAACTGTCACAGAGGTTGTCCGTCCACATATAGCAAGGATGCGATACCGTGTCACTCGATTGCCACAGTATTTAATTCTTCATATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAGAAGAATCCCACATTAGGTAGTTATTGAGTTGCGTTATATTTGTAAAACGATTACTGCATAATGAAGCCTTCTTAATGTGCCATCCTGAGGTAATCTAATGATTGACATTTTGGTCTCAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATTCCCTTGCCAACACCAAAAGAGAATGAAAAATTGCGTTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGTAAACCCGATGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTAATTTTCTTCTCCCTTTTTGTAAATGGTGCCCAATAATCTAGAATTTGTCTGATCCATTGCAATCTAATATTGAAATTACAGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTTATATGCAGATATATGAACGGCAGCAATAG

mRNA sequence

ATGGGATCAAAGAGGCGAGATGATAATGACGTAGATGAGGAAGAGTTAGCTCCAGTGTTAAAGAGGCAAAAATTACTTGGGGAGTTCTCACCTTCTTCTTCTCCACCTGCCTCAGAGAACACTCGGCTTCCTGGTTTTAACTATGGTGATGATGATGATGAAGAAGATTACAAATTTAAACAAAATGGAAGTAGATATGATAGAGATGAAGGGGATGACAATGATGAGGAGGAAGATGATGAACAGAATGATGATGATGCAAGTCATGTAAAGCGAAGCCGTGATGTTGAAGTTCGGAAAGATTGTCCTTATCTGGATACTGTAAACCGACAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTGAATGTTTATGCCTGCCTCGTATGTGGCAAGTACTATCAAGGGAGGGGGAAGAAATCTCATGCTTACACTCACAGTCTCGAAGCAGGACACCATGTCTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCCTCATTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAGAGCAGCTTGACAGGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATGGTGGGGCTTAACAACATTAAGGAAACAGATTTTGTAAATGTGACAATTCAATCCTTAATGAGGGTTACTCCACTCAGGAACTTCTTCCTAATACCCGAGAACTATCAGCACTGCAAATCTCCACTTGTCCAACGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGGAACTTCAAAGGCCAGGTAAGCCCGCATGAATTTCTGCAAGCAGTCATGAAGGCTAGTAAAAAACATTTCCGAATTGGTGCGCAGTCGGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTTCAAAGAAAAGTAGCAGTATAATCTACGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAACATGGTGACGATCAGGATGCTGGAAGTGAAGGTAGCAGTGTCATTATGGAAACTTCTAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCGCCACCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGGTTCCGCTCTTCAACATTTTGAAGAAATTTGATGGTGAAACTGTCACAGAGGTTGTCCGTCCACATATAGCAAGGATGCGATACCGTGTCACTCGATTGCCACAGTATTTAATTCTTCATATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAGAAGAATCCCACATTAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATTCCCTTGCCAACACCAAAAGAGAATGAAAAATTGCGTTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGTAAACCCGATGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTTATATGCAGATATATGAACGGCAGCAATAG

Coding sequence (CDS)

ATGGGATCAAAGAGGCGAGATGATAATGACGTAGATGAGGAAGAGTTAGCTCCAGTGTTAAAGAGGCAAAAATTACTTGGGGAGTTCTCACCTTCTTCTTCTCCACCTGCCTCAGAGAACACTCGGCTTCCTGGTTTTAACTATGGTGATGATGATGATGAAGAAGATTACAAATTTAAACAAAATGGAAGTAGATATGATAGAGATGAAGGGGATGACAATGATGAGGAGGAAGATGATGAACAGAATGATGATGATGCAAGTCATGTAAAGCGAAGCCGTGATGTTGAAGTTCGGAAAGATTGTCCTTATCTGGATACTGTAAACCGACAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTGAATGTTTATGCCTGCCTCGTATGTGGCAAGTACTATCAAGGGAGGGGGAAGAAATCTCATGCTTACACTCACAGTCTCGAAGCAGGACACCATGTCTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCCTCATTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAGAGCAGCTTGACAGGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATGGTGGGGCTTAACAACATTAAGGAAACAGATTTTGTAAATGTGACAATTCAATCCTTAATGAGGGTTACTCCACTCAGGAACTTCTTCCTAATACCCGAGAACTATCAGCACTGCAAATCTCCACTTGTCCAACGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGGAACTTCAAAGGCCAGGTAAGCCCGCATGAATTTCTGCAAGCAGTCATGAAGGCTAGTAAAAAACATTTCCGAATTGGTGCGCAGTCGGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTTCAAAGAAAAGTAGCAGTATAATCTACGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAACATGGTGACGATCAGGATGCTGGAAGTGAAGGTAGCAGTGTCATTATGGAAACTTCTAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCGCCACCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGGTTCCGCTCTTCAACATTTTGAAGAAATTTGATGGTGAAACTGTCACAGAGGTTGTCCGTCCACATATAGCAAGGATGCGATACCGTGTCACTCGATTGCCACAGTATTTAATTCTTCATATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAGAAGAATCCCACATTAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATTCCCTTGCCAACACCAAAAGAGAATGAAAAATTGCGTTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGTAAACCCGATGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTTATATGCAGATATATGAACGGCAGCAATAG

Protein sequence

MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFKQNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKHFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETVTEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Homology
BLAST of Sgr020857 vs. NCBI nr
Match: XP_022142503.1 (U4/U6.U5 tri-snRNP-associated protein 2-like [Momordica charantia])

HSP 1 Score: 1047.0 bits (2706), Expect = 5.8e-302
Identity = 525/550 (95.45%), Postives = 533/550 (96.91%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRRDDN VDEEELAP +KRQKLLGEFSPSS PPASEN RLPGFNYGDDD+EEDYKFK
Sbjct: 1   MGSKRRDDNAVDEEELAPEIKRQKLLGEFSPSSPPPASENPRLPGFNYGDDDEEEDYKFK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSR   D GDDND+EEDDE  DDDA+HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSRNGGDGGDDNDDEEDDEY-DDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRF KEQVE LD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFTKEQVELLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIPENYQHCKSPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPENYQHCKSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRMSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGE +
Sbjct: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGEAI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKENEKLRSKYDLIANIVHDGKPDEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKENEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 549

BLAST of Sgr020857 vs. NCBI nr
Match: XP_038894256.1 (U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida] >XP_038894257.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1035.4 bits (2676), Expect = 1.8e-298
Identity = 515/550 (93.64%), Postives = 536/550 (97.45%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRR+++ +DEEEL P LKR KLLGE SP SSPPASEN +LPGFNYGDD++EE+YKFK
Sbjct: 1   MGSKRRNNSLLDEEELGPDLKRHKLLGEVSP-SSPPASENPQLPGFNYGDDEEEEEYKFK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSRYD DEGDDND+EEDDE++DDDA+HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSRYDGDEGDDNDDEEDDEEHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIPENYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           DDQDAG+E SSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 361 DDQDAGTEDSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+NEKL SKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNEKLCSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 549

BLAST of Sgr020857 vs. NCBI nr
Match: XP_038894254.1 (U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida] >XP_038894255.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1035.4 bits (2676), Expect = 1.8e-298
Identity = 515/550 (93.64%), Postives = 536/550 (97.45%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRR+++ +DEEEL P LKR KLLGE SP SSPPASEN +LPGFNYGDD++EE+YKFK
Sbjct: 47  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSP-SSPPASENPQLPGFNYGDDEEEEEYKFK 106

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSRYD DEGDDND+EEDDE++DDDA+HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 107 QNGSRYDGDEGDDNDDEEDDEEHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 166

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 167 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 226

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 227 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 286

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIPENYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 287 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 346

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKE+G
Sbjct: 347 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENG 406

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           DDQDAG+E SSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 407 DDQDAGTEDSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 466

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 467 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 526

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+NEKL SKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 527 PKDNEKLCSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 586

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 587 AYMQIYERQQ 595

BLAST of Sgr020857 vs. NCBI nr
Match: XP_008463627.1 (PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_008463628.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_008463629.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_008463630.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo])

HSP 1 Score: 1032.3 bits (2668), Expect = 1.5e-297
Identity = 510/550 (92.73%), Postives = 535/550 (97.27%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRR+++ +DEEEL P LKR KLLGE SPSSSPPASEN +LPGFNYGDDD+EED+KFK
Sbjct: 1   MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YD DEGD ND+EEDDE++DD  + VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDADEGDYNDDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIPENYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           D+QDAG++GSSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 361 DEQDAGTQGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KLRSKYDLIAN+VHDGKP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANVVHDGKPNEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Sgr020857 vs. NCBI nr
Match: XP_011655023.1 (U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus] >KGN65053.1 hypothetical protein Csa_013002 [Cucumis sativus])

HSP 1 Score: 1025.8 bits (2651), Expect = 1.4e-295
Identity = 509/550 (92.55%), Postives = 532/550 (96.73%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRR ++ +DEEEL P LKR KLLGE SPSSSPPASEN +LPGFNYGDDD+EED+KFK
Sbjct: 1   MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YD DEGD ND+EEDDE+ D++ + VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIP NYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           ++QDAG+EGSSV METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 361 EEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KLRSKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Sgr020857 vs. ExPASy Swiss-Prot
Match: Q3TIX9 (U4/U6.U5 tri-snRNP-associated protein 2 OS=Mus musculus OX=10090 GN=Usp39 PE=1 SV=2)

HSP 1 Score: 474.6 bits (1220), Expect = 1.6e-132
Identity = 261/500 (52.20%), Postives = 336/500 (67.20%), Query Frame = 0

Query: 65  RYDRDEGDDNDEEEDDE---QNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFC 124
           R  R+   D D E + E   +N    S  +RSR       CPYLDT+NR VLDFDFEK C
Sbjct: 70  RVKREREADEDSEPEREVRAKNGRVDSEDRRSR------HCPYLDTINRSVLDFDFEKLC 129

Query: 125 SVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDP 184
           S+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD YEI D 
Sbjct: 130 SISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYEIIDS 189

Query: 185 SLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLM 244
           SL+DI YVL P F K+Q+  LD+  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L 
Sbjct: 190 SLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVLQALS 249

Query: 245 RVTPLRNFFLIPENYQHCKSP-------LVQRFGELTRKIWHARNFKGQVSPHEFLQAVM 304
            V PLRN+FL  +NY++ K P       LVQRFGEL RK+W+ RNFK  VSPHE LQAV+
Sbjct: 250 NVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEMLQAVV 309

Query: 305 KASKKHFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSK 364
             SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H  
Sbjct: 310 LCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVNDVFQGSMRIFTKKLPHPD 369

Query: 365 ALTEKKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNI 424
              E+KE     D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNI
Sbjct: 370 LPAEEKEQLLHND---EYQETMVEST---FMYLTLDLPTAPLYKDEKEQLIIPQVPLFNI 429

Query: 425 LKKFDGETVTE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKN 484
           L KF+G T  E    + +  + R+++T+LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N
Sbjct: 430 LAKFNGITEKEYKTYKENFLK-RFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITN 489

Query: 485 LELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVS 544
           ++L++Y+       ++   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V+
Sbjct: 490 VDLREYLSEEVQAVHK--NTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVT 549

Query: 545 ETLPQMVALSEAYMQIYERQ 550
           + LPQM+ LSEAY+QI++R+
Sbjct: 550 DILPQMITLSEAYIQIWKRR 554

BLAST of Sgr020857 vs. ExPASy Swiss-Prot
Match: Q53GS9 (U4/U6.U5 tri-snRNP-associated protein 2 OS=Homo sapiens OX=9606 GN=USP39 PE=1 SV=2)

HSP 1 Score: 474.2 bits (1219), Expect = 2.0e-132
Identity = 261/500 (52.20%), Postives = 336/500 (67.20%), Query Frame = 0

Query: 65  RYDRDEGDDNDEEEDDE---QNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFC 124
           R  R+   D D E + E   +N    S  +RSR       CPYLDT+NR VLDFDFEK C
Sbjct: 71  RVKREREVDEDSEPEREVRAKNGRVDSEDRRSR------HCPYLDTINRSVLDFDFEKLC 130

Query: 125 SVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDP 184
           S+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD YEI D 
Sbjct: 131 SISLSHINAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYEIIDS 190

Query: 185 SLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLM 244
           SL+DI YVL P F K+Q+  LD+  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L 
Sbjct: 191 SLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVLQALS 250

Query: 245 RVTPLRNFFLIPENYQHCKSP-------LVQRFGELTRKIWHARNFKGQVSPHEFLQAVM 304
            V PLRN+FL  +NY++ K P       LVQRFGEL RK+W+ RNFK  VSPHE LQAV+
Sbjct: 251 NVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEMLQAVV 310

Query: 305 KASKKHFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSK 364
             SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H  
Sbjct: 311 LCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKLPHPD 370

Query: 365 ALTEKKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNI 424
              E+KE     D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNI
Sbjct: 371 LPAEEKEQLLHND---EYQETMVEST---FMYLTLDLPTAPLYKDEKEQLIIPQVPLFNI 430

Query: 425 LKKFDGETVTE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKN 484
           L KF+G T  E    + +  + R+++T+LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N
Sbjct: 431 LAKFNGITEKEYKTYKENFLK-RFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITN 490

Query: 485 LELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVS 544
           ++L++Y+       ++   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V+
Sbjct: 491 VDLREYLSEEVQAVHK--NTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVT 550

Query: 545 ETLPQMVALSEAYMQIYERQ 550
           + LPQM+ LSEAY+QI++R+
Sbjct: 551 DILPQMITLSEAYIQIWKRR 555

BLAST of Sgr020857 vs. ExPASy Swiss-Prot
Match: Q5R761 (U4/U6.U5 tri-snRNP-associated protein 2 OS=Pongo abelii OX=9601 GN=USP39 PE=2 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 6.0e-132
Identity = 261/500 (52.20%), Postives = 335/500 (67.00%), Query Frame = 0

Query: 65  RYDRDEGDDNDEEEDDE---QNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFC 124
           R  R+   D D E + E   +N    S  +RSR       CPYLDT+NR VLDFDFEK C
Sbjct: 71  RVKREREVDEDSEPEREVRAKNGRVDSEDRRSR------HCPYLDTINRSVLDFDFEKLC 130

Query: 125 SVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDP 184
           S+S S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD YEI D 
Sbjct: 131 SISPSHVNAYACLVCGKYFQGRGLKSHAYIHSVQFSHHVFLNLHTLKFYCLPDNYEIIDS 190

Query: 185 SLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLM 244
           SL+DI YVL P F K+Q+  LD+  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L 
Sbjct: 191 SLEDITYVLKPTFTKQQIANLDKQAKLSRAYDGTTYLPGIVGLNNIKANDYANAVLQALS 250

Query: 245 RVTPLRNFFLIPENYQHCKSP-------LVQRFGELTRKIWHARNFKGQVSPHEFLQAVM 304
            V PLRN+FL  +NY++ K P       LVQRFGEL RK+W+ RNFK  VSPHE LQAV+
Sbjct: 251 NVPPLRNYFLEEDNYKNIKRPPGDIMFLLVQRFGELMRKLWNPRNFKAHVSPHEMLQAVV 310

Query: 305 KASKKHFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSK 364
             SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H  
Sbjct: 311 LCSKKTFQITKQGDGVDFLSWFLNALHSALGGTKKKKKTIVTDVFQGSMRIFTKKLPHPD 370

Query: 365 ALTEKKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNI 424
              E+KE     D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNI
Sbjct: 371 LPAEEKEQLLHND---EYQETMVEST---FMYLTLDLPTAPLYKDEKEQLIIPQVPLFNI 430

Query: 425 LKKFDGETVTE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKN 484
           L KF+G T  E    + +  + R+++T+LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N
Sbjct: 431 LAKFNGITEKEYKTYKENFLK-RFQLTKLPPYLIFCIKRFTKNNFFVEKNPTIVNFPITN 490

Query: 485 LELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVS 544
           ++L++Y+       +E   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V+
Sbjct: 491 VDLREYLSEEVQAVHE--NTTYDLIANIVHDGKPSEGSYRIHVLHHGTGKWYELQDLQVT 550

Query: 545 ETLPQMVALSEAYMQIYERQ 550
           + LPQM+ LSEAY+QI++R+
Sbjct: 551 DILPQMITLSEAYIQIWKRR 555

BLAST of Sgr020857 vs. ExPASy Swiss-Prot
Match: Q9USR2 (Probable mRNA-splicing protein ubp10 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=ubp10 PE=3 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 8.4e-102
Identity = 211/497 (42.45%), Postives = 302/497 (60.76%), Query Frame = 0

Query: 67  DRDEGDDNDEEEDDEQNDD-DASHVKRSRDVEVRKDCP------YLDTVNRQVLDFDFEK 126
           + D   DN + +  E   D +  H   S+++E  +  P      YLDT+NR++LDFDFEK
Sbjct: 16  EEDNNIDNGKRKKLELGKDMEDVHDIASKEMEEHETTPIISQNLYLDTINRKLLDFDFEK 75

Query: 127 FCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIN 186
            CSVSL+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N  T K Y LP+ Y++ 
Sbjct: 76  VCSVSLTNLSVYACLVCGRYFQGRGPSSHAYFHALTENHHVFVNCSTLKFYVLPESYQVE 135

Query: 187 DPSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQS 246
             +L DI YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+ NV I  
Sbjct: 136 SSALQDIAYVMRPTFTKLEVQRLDHTPQLSYDLMLKPYVPGFVGMNNIKNNDYFNVVIHM 195

Query: 247 LMRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 306
           L  V P RN+FL+ +N+ +C   LVQR   L RK+W+ + FK  VSP E +Q V   S K
Sbjct: 196 LAHVKPFRNYFLL-KNFDNCPQ-LVQRLAILIRKLWNHKAFKSHVSPQELIQEVTVLSHK 255

Query: 307 HFRIGAQSDPVEFMSWFLNTLHSELRISK----KSSSIIYECFQGELEVVKEIHSKALTE 366
            + I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I S+ + +
Sbjct: 256 KYSINEQKDPVEFLSWFLNTLHNCLGGKKSTIAKPTSIVHYSFQGFV----RIESQKIRQ 315

Query: 367 KKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKF 426
             E G  +     G  VI +T+ +PFL L LDLPP P+F+D  E NIIPQV L  IL K+
Sbjct: 316 HAEKG--EQVVFTGDRVI-QTNVVPFLYLTLDLPPKPIFQDEFEGNIIPQVELKEILNKY 375

Query: 427 DGETVTEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY 486
           +G    E+      R R+ +   P Y I H++RF KNN+F E+N T+V FP+ + ++  +
Sbjct: 376 NGVHTQELAG---MRRRFHLMTAPPYFIFHIKRFMKNNYFTERNQTIVTFPLDDFDMSPF 435

Query: 487 IPLPTPKENEKLRSKYDLIANIVHD----GKPDEGYYRVFVQRKSEELWYEMQDLHVSET 546
           I     + N K+ +KY+L+ANI+H+     + +   +R+ ++  S   WY++QDL+V E 
Sbjct: 436 IDDSFIQSNPKISTKYNLVANIIHESVTHAEEEFHNFRIQIRNPSTNKWYQIQDLYVEEI 495

Query: 547 LPQMVALSEAYMQIYER 549
              M+ L E+++Q++ER
Sbjct: 496 SSDMIRLGESFIQLWER 500

BLAST of Sgr020857 vs. ExPASy Swiss-Prot
Match: P43589 (Pre-mRNA-splicing factor SAD1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=SAD1 PE=1 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 2.3e-43
Identity = 142/500 (28.40%), Postives = 232/500 (46.40%), Query Frame = 0

Query: 70  EGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLN 129
           E D+     +DE   +    +K         +  YL+TV R+ LDFD EK C ++LS LN
Sbjct: 2   EVDNKRRHSEDELKQEAVKKIKSQ-----EPNYAYLETVVREKLDFDSEKICCITLSPLN 61

Query: 130 VYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPS----LDD 189
           VY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y LP   +I        L+ 
Sbjct: 62  VYCCLVCGHYYQGRHEKSPAFIHSIDENHHVFLNLTSLKFYMLPQNVQILHDGEVQLLNS 121

Query: 190 IRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTP 249
           I++   P +  + +E   R       L    YL G +G  N    D+ +  +  +  + P
Sbjct: 122 IKFAAYPTYCPKDLEDFPRQ---CFDLSNRTYLNGFIGFTNAATYDYAHSVLLLISHMVP 181

Query: 250 LRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKHFRIGA 309
           +R+ FL+  N+   +   ++R     +KIW  + FK  +S  +F+      S    R G 
Sbjct: 182 VRDHFLL--NHFDNQGEFIKRLSICVKKIWSPKLFKHHLSVDDFV------SYLKVREGL 241

Query: 310 QSDPVE---FMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHGDD 369
             +P++   F+ W  N + S    S    SI+    +G++++ K        E K    +
Sbjct: 242 NLNPIDPRLFLLWLFNKICSS---SNDLKSILNHSCKGKVKIAK-------VENKPEASE 301

Query: 370 QDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETVTE 429
              G     VI++    PF +L LDLP    F+D    + +PQ+ +  +L KF       
Sbjct: 302 SVTG----KVIVK----PFWVLTLDLPEFSPFEDGNSVDDLPQINITKLLTKFTKS---- 361

Query: 430 VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK 489
             R       + +TRLPQ+LI H  RF +N+          + PVKN   ++   +    
Sbjct: 362 --RSSSTSTVFELTRLPQFLIFHFNRFDRNS----------DHPVKN---RNQTLVEFSS 421

Query: 490 ENEKLRSKYDLIANIVH--------DGK----PDEGYYRVFVQRKSEELWYEMQDLHVSE 549
           E E L  KY L AN+VH        DG      ++ ++   +     E W E+  ++ +E
Sbjct: 422 ELEILHVKYRLKANVVHVVIKQPSTDGNAFNGDEKSHWITQLYDNKSEKWIEIDGINTTE 448

Query: 550 TLPQMVALSEAYMQIYERQQ 551
              +++ L E ++Q++E+Q+
Sbjct: 482 REAELLFLKETFIQVWEKQE 448

BLAST of Sgr020857 vs. ExPASy TrEMBL
Match: A0A6J1CND9 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Momordica charantia OX=3673 GN=LOC111012606 PE=4 SV=1)

HSP 1 Score: 1047.0 bits (2706), Expect = 2.8e-302
Identity = 525/550 (95.45%), Postives = 533/550 (96.91%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRRDDN VDEEELAP +KRQKLLGEFSPSS PPASEN RLPGFNYGDDD+EEDYKFK
Sbjct: 1   MGSKRRDDNAVDEEELAPEIKRQKLLGEFSPSSPPPASENPRLPGFNYGDDDEEEDYKFK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGSR   D GDDND+EEDDE  DDDA+HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSRNGGDGGDDNDDEEDDEY-DDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRF KEQVE LD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFTKEQVELLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIPENYQHCKSPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPENYQHCKSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRMSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGE +
Sbjct: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGEAI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKENEKLRSKYDLIANIVHDGKPDEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKENEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 549

BLAST of Sgr020857 vs. ExPASy TrEMBL
Match: A0A1S3CJP7 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucumis melo OX=3656 GN=LOC103501728 PE=4 SV=1)

HSP 1 Score: 1032.3 bits (2668), Expect = 7.2e-298
Identity = 510/550 (92.73%), Postives = 535/550 (97.27%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRR+++ +DEEEL P LKR KLLGE SPSSSPPASEN +LPGFNYGDDD+EED+KFK
Sbjct: 1   MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YD DEGD ND+EEDDE++DD  + VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDADEGDYNDDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIPENYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           D+QDAG++GSSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 361 DEQDAGTQGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KLRSKYDLIAN+VHDGKP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANVVHDGKPNEGCYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Sgr020857 vs. ExPASy TrEMBL
Match: A0A0A0LTF9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G185102 PE=4 SV=1)

HSP 1 Score: 1025.8 bits (2651), Expect = 6.7e-296
Identity = 509/550 (92.55%), Postives = 532/550 (96.73%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKRR ++ +DEEEL P LKR KLLGE SPSSSPPASEN +LPGFNYGDDD+EED+KFK
Sbjct: 1   MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS+YD DEGD ND+EEDDE+ D++ + VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+NKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIP NYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           ++QDAG+EGSSV METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 361 EEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PK+N+KLRSKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 550

BLAST of Sgr020857 vs. ExPASy TrEMBL
Match: A0A6J1EKT5 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111434213 PE=4 SV=1)

HSP 1 Score: 1018.5 bits (2632), Expect = 1.1e-293
Identity = 510/550 (92.73%), Postives = 527/550 (95.82%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKR++D+ VDEEEL P LKR K LGE SP SSPPASEN +LPGFNYGDDD+EEDYK K
Sbjct: 1   MGSKRQNDSVVDEEELGPDLKRHKSLGESSP-SSPPASENPQLPGFNYGDDDEEEDYKSK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS YD DEGD      DDE+ND+D +H+ RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSGYDGDEGDGT----DDEENDEDENHIMRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+ KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIPENYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           DDQDAG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 361 DDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKE+EKLRSKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 545

BLAST of Sgr020857 vs. ExPASy TrEMBL
Match: A0A6J1KIZ3 (U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC111495660 PE=4 SV=1)

HSP 1 Score: 1017.3 bits (2629), Expect = 2.4e-293
Identity = 509/550 (92.55%), Postives = 527/550 (95.82%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           MGSKR++D+ VDEEEL P LKR K LGE SP SSPPASEN +LPGFNYGDDD+EEDYK K
Sbjct: 1   MGSKRQNDSVVDEEELGPDLKRHKSLGELSP-SSPPASENPQLPGFNYGDDDEEEDYKSK 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120
           QNGS YD DEGD      DDE+ND+D +H+ RSRDVEVRKDCPYLDTVNRQVLDFDFEKF
Sbjct: 61  QNGSGYDGDEGD----VTDDEENDEDENHIMRSRDVEVRKDCPYLDTVNRQVLDFDFEKF 120

Query: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180
           CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND
Sbjct: 121 CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIND 180

Query: 181 PSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240
           PSLDDIRYVLNPRFAKEQVEQLD+ KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL
Sbjct: 181 PSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSL 240

Query: 241 MRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKH 300
           MRVTPLRNFFLIP+NYQHC+SPLV RFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK 
Sbjct: 241 MRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR 300

Query: 301 FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKEHG 360
           FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKE+G
Sbjct: 301 FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENG 360

Query: 361 DDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETV 420
           DDQDAG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGET+
Sbjct: 361 DDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI 420

Query: 421 TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480
           TEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT
Sbjct: 421 TEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPT 480

Query: 481 PKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540
           PKE+EKLRSKYDLIANIVHDGKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE
Sbjct: 481 PKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSE 540

Query: 541 AYMQIYERQQ 551
           AYMQIYERQQ
Sbjct: 541 AYMQIYERQQ 545

BLAST of Sgr020857 vs. TAIR 10
Match: AT4G22350.2 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 760.4 bits (1962), Expect = 1.0e-219
Identity = 403/561 (71.84%), Postives = 453/561 (80.75%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPAS-ENTRLPGFNYGDDDDEEDYKF 60
           M  +R   N V EEE    +KR++++ E S S  PP    N  LP  N  DDD+ +  K 
Sbjct: 1   MKGEREVKNGVSEEERE--VKRKRVM-ERSDSPPPPLGFNNPLLPFANAYDDDNNQQNKS 60

Query: 61  KQNGSRYDRDEGDDND-------EEEDDEQNDDDAS--HVKRSRDVEVRKDCPYLDTVNR 120
           +   +   + EG+ N        E +DDE +DDDAS    K SR VEVR+DCPYLDTVNR
Sbjct: 61  QTRCNVVAKGEGNGNKVKGEAQVEVDDDEDDDDDASKGRGKHSRHVEVRRDCPYLDTVNR 120

Query: 121 QVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVY 180
           QVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVY
Sbjct: 121 QVLDFDFERFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTEKVY 180

Query: 181 CLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKET 240
           CLPD YEINDPSLDDIR+VLNPRF++ QV +LD+N+QWSRALDGSDYLPGMVGLNNI++T
Sbjct: 181 CLPDSYEINDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNIQKT 240

Query: 241 DFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFL 300
           +FVNVTIQSLMRVTPLRNFFLIPENYQHCKSPL  RFGELTRKIWHARNFKGQVSPHEFL
Sbjct: 241 EFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLGHRFGELTRKIWHARNFKGQVSPHEFL 300

Query: 301 QAVMKASKKHFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHS 360
           QAVMKASKK FRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE   
Sbjct: 301 QAVMKASKKRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKEYQ- 360

Query: 361 KALTEKKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFN 420
                          G+E      E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF+
Sbjct: 361 ---------------GNENK----EISRMPFLMLGLDLPPPPLFKDVMEKNIIPQVALFD 420

Query: 421 ILKKFDGETVTEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNL 480
           +LKKFDGETVTEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++
Sbjct: 421 LLKKFDGETVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTLVNFPVKDM 480

Query: 481 ELKDYIP-LPTPKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVS 540
           EL+DYIP LP   E E + SKY+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLHV+
Sbjct: 481 ELRDYIPSLPRAPEGENVCSKYNLIANIVHDGKPEDGYFRVFVQRKSQELWYEMQDLHVA 538

Query: 541 ETLPQMVALSEAYMQIYERQQ 551
           ETLPQMV LSEAYMQIYE+Q+
Sbjct: 541 ETLPQMVELSEAYMQIYEQQE 538

BLAST of Sgr020857 vs. TAIR 10
Match: AT4G22285.1 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 759.2 bits (1959), Expect = 2.3e-219
Identity = 402/563 (71.40%), Postives = 453/563 (80.46%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPAS-ENTRLPGFNYGDDDDEED--- 60
           M  +R   N V EEE    +KR++++ E S S  PP    N  LP  N  DDDDEE+   
Sbjct: 1   MKGEREVKNGVSEEERE--VKRKRVM-ERSDSPPPPLGFNNPLLPLANTYDDDDEEEENE 60

Query: 61  -YKFKQNGSRYDRDEGDDN-------DEEEDDEQNDDDASHVKRSRDVEVRKDCPYLDTV 120
             K +  G+   + EG+ N       +E +DDE +D      K SR VEVR+DCPYLDTV
Sbjct: 61  QKKSQARGNGVAKGEGNGNKVKGEAQEEVDDDEDDDVSKGKGKHSRHVEVRRDCPYLDTV 120

Query: 121 NRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEK 180
           NRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEK
Sbjct: 121 NRQVLDFDFERFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTEK 180

Query: 181 VYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIK 240
           VYCLPD YEINDPSLDDIR+VLNPRF++ QV +LD+N+QWSRALDGSDYLPGMVGLNNI+
Sbjct: 181 VYCLPDSYEINDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNIQ 240

Query: 241 ETDFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHE 300
           +T+FVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLV RFGELTRKIWHARNFKGQVSPHE
Sbjct: 241 KTEFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLVHRFGELTRKIWHARNFKGQVSPHE 300

Query: 301 FLQAVMKASKKHFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEI 360
           FLQAVMKASKK FRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE 
Sbjct: 301 FLQAVMKASKKRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKEF 360

Query: 361 HSKALTEKKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPL 420
                            G+E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV L
Sbjct: 361 Q----------------GNENK----EISRMSFLMLGLDLPPPPLFKDVMEKNIIPQVAL 420

Query: 421 FNILKKFDGETVTEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVK 480
           F++LKKFDGETVTEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK
Sbjct: 421 FDLLKKFDGETVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTLVNFPVK 480

Query: 481 NLELKDYIP-LPTPKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLH 540
           ++EL+DYIP LP   E E + SKY+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLH
Sbjct: 481 DMELRDYIPSLPRAPEGENVCSKYNLIANIVHDGKPEDGYFRVFVQRKSQELWYEMQDLH 540

Query: 541 VSETLPQMVALSEAYMQIYERQQ 551
           V+ETLPQMV LSEAYMQIYE+++
Sbjct: 541 VAETLPQMVELSEAYMQIYEQEE 540

BLAST of Sgr020857 vs. TAIR 10
Match: AT4G22350.1 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 751.9 bits (1940), Expect = 3.6e-217
Identity = 396/553 (71.61%), Postives = 445/553 (80.47%), Query Frame = 0

Query: 1   MGSKRRDDNDVDEEELAPVLKRQKLLGEFSPSSSPPASENTRLPGFNYGDDDDEEDYKFK 60
           M  +R   N V EEE    +KR++++ E S S  PP                     K +
Sbjct: 1   MKGEREVKNGVSEEERE--VKRKRVM-ERSDSPPPPLVA------------------KGE 60

Query: 61  QNGSRYDRDEGDDNDEEEDDEQNDDDAS--HVKRSRDVEVRKDCPYLDTVNRQVLDFDFE 120
            NG++    +G+   E +DDE +DDDAS    K SR VEVR+DCPYLDTVNRQVLDFDFE
Sbjct: 61  GNGNKV---KGEAQVEVDDDEDDDDDASKGRGKHSRHVEVRRDCPYLDTVNRQVLDFDFE 120

Query: 121 KFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEI 180
           +FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEI
Sbjct: 121 RFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTEKVYCLPDSYEI 180

Query: 181 NDPSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQ 240
           NDPSLDDIR+VLNPRF++ QV +LD+N+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQ
Sbjct: 181 NDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNIQKTEFVNVTIQ 240

Query: 241 SLMRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK 300
           SLMRVTPLRNFFLIPENYQHCKSPL  RFGELTRKIWHARNFKGQVSPHEFLQAVMKASK
Sbjct: 241 SLMRVTPLRNFFLIPENYQHCKSPLGHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK 300

Query: 301 KHFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKE 360
           K FRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE           
Sbjct: 301 KRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKEYQ--------- 360

Query: 361 HGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGE 420
                  G+E      E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGE
Sbjct: 361 -------GNENK----EISRMPFLMLGLDLPPPPLFKDVMEKNIIPQVALFDLLKKFDGE 420

Query: 421 TVTEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP- 480
           TVTEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP 
Sbjct: 421 TVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTLVNFPVKDMELRDYIPS 480

Query: 481 LPTPKENEKLRSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVA 540
           LP   E E + SKY+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV 
Sbjct: 481 LPRAPEGENVCSKYNLIANIVHDGKPEDGYFRVFVQRKSQELWYEMQDLHVAETLPQMVE 509

Query: 541 LSEAYMQIYERQQ 551
           LSEAYMQIYE+Q+
Sbjct: 541 LSEAYMQIYEQQE 509

BLAST of Sgr020857 vs. TAIR 10
Match: AT4G22410.1 (Ubiquitin C-terminal hydrolases superfamily protein )

HSP 1 Score: 567.8 bits (1462), Expect = 9.7e-162
Identity = 281/355 (79.15%), Postives = 304/355 (85.63%), Query Frame = 0

Query: 108 VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE 167
           V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TE
Sbjct: 2   VEFQVLDFHFERFCSVSLSNLNVYACLVCGKYFQGRSQKSHAYTHSLEAGHHVYINLLTE 61

Query: 168 KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDRNKQWSRALDGSDYLPGMVGLNNI 227
           KVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LD+N+QWSRALDGSDYLPGMVGLNNI
Sbjct: 62  KVYCLPDSYEINDPSLDDIRHVLNPRFSRAQVNELDKNRQWSRALDGSDYLPGMVGLNNI 121

Query: 228 KETDFVNVTIQSLMRVTPLRNFFLIPENYQHCKSPLVQRFGELTRKIWHARNFKGQVSPH 287
           ++T+FVNVTIQSLMRVTPLRNFF IPENYQHCKSPLV  FGELTRKIWHARNFKGQVSPH
Sbjct: 122 QKTEFVNVTIQSLMRVTPLRNFFHIPENYQHCKSPLVHCFGELTRKIWHARNFKGQVSPH 181

Query: 288 EFLQAVMKASKKHFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKE 347
           EFLQAVMKASKK FRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE
Sbjct: 182 EFLQAVMKASKKRFRIGQQSDPVEFMSWLLNTLHMDLRTSKDASSIIHKCFQGELEVVKE 241

Query: 348 IHSKALTEKKEHGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP 407
                             G+E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Sbjct: 242 FQ----------------GNENK----EISRMSFLMLGLDLPPPPLFKDVMEKNIIPQVA 301

Query: 408 LFNILKKFDGETVTEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL 463
           LF++LKKFDGETVTEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTL
Sbjct: 302 LFDLLKKFDGETVTEVVRPKLARMRYRVIKSPRYLMFHMVRFKKNNFFKEKNPTL 336

BLAST of Sgr020857 vs. TAIR 10
Match: AT4G22420.1 (Ubiquitin-specific protease family C19-related protein )

HSP 1 Score: 53.9 bits (128), Expect = 4.7e-07
Identity = 49/108 (45.37%), Postives = 65/108 (60.19%), Query Frame = 0

Query: 23  QKLLGEFSPSSSPPASENTR-LPGFN-YGDDDDEEDYKFKQNGSRYD---RDEGDDN--- 82
           +K + E S S  PP   N   LP  N Y DDD+EE  + K++ +R +   + EG+ N   
Sbjct: 39  EKRVIERSDSPPPPLGFNNHLLPLANAYDDDDEEEGNELKKSQARRNGVAKGEGNGNKVN 98

Query: 83  ----DEEEDDEQNDDDAS--HVKRSRDVEVRKDCPYLDTVNRQVLDFD 117
               +E +D+E +DDDAS    K SR VEVR+DCPYLDTVNRQV+  D
Sbjct: 99  GEAQEEVDDEEDDDDDASKGRGKHSRHVEVRRDCPYLDTVNRQVIIID 146

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022142503.15.8e-30295.45U4/U6.U5 tri-snRNP-associated protein 2-like [Momordica charantia][more]
XP_038894256.11.8e-29893.64U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida] >XP_... [more]
XP_038894254.11.8e-29893.64U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida] >XP_... [more]
XP_008463627.11.5e-29792.73PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo] >XP_00846... [more]
XP_011655023.11.4e-29592.55U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus] >KGN65053.1 hypothetic... [more]
Match NameE-valueIdentityDescription
Q3TIX91.6e-13252.20U4/U6.U5 tri-snRNP-associated protein 2 OS=Mus musculus OX=10090 GN=Usp39 PE=1 S... [more]
Q53GS92.0e-13252.20U4/U6.U5 tri-snRNP-associated protein 2 OS=Homo sapiens OX=9606 GN=USP39 PE=1 SV... [more]
Q5R7616.0e-13252.20U4/U6.U5 tri-snRNP-associated protein 2 OS=Pongo abelii OX=9601 GN=USP39 PE=2 SV... [more]
Q9USR28.4e-10242.45Probable mRNA-splicing protein ubp10 OS=Schizosaccharomyces pombe (strain 972 / ... [more]
P435892.3e-4328.40Pre-mRNA-splicing factor SAD1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / ... [more]
Match NameE-valueIdentityDescription
A0A6J1CND92.8e-30295.45U4/U6.U5 tri-snRNP-associated protein 2-like OS=Momordica charantia OX=3673 GN=L... [more]
A0A1S3CJP77.2e-29892.73U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucumis melo OX=3656 GN=LOC10350... [more]
A0A0A0LTF96.7e-29692.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G185102 PE=4 SV=1[more]
A0A6J1EKT51.1e-29392.73U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1KIZ32.4e-29392.55U4/U6.U5 tri-snRNP-associated protein 2-like OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
Match NameE-valueIdentityDescription
AT4G22350.21.0e-21971.84Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22285.12.3e-21971.40Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22350.13.6e-21771.61Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22410.19.7e-16279.15Ubiquitin C-terminal hydrolases superfamily protein [more]
AT4G22420.14.7e-0745.37Ubiquitin-specific protease family C19-related protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001607Zinc finger, UBP-typeSMARTSM00290Zf_UBP_1coord: 120..169
e-value: 1.6E-18
score: 77.5
IPR001607Zinc finger, UBP-typePFAMPF02148zf-UBPcoord: 121..182
e-value: 4.7E-14
score: 52.5
IPR001607Zinc finger, UBP-typePROSITEPS50271ZF_UBPcoord: 119..180
score: 18.232674
NoneNo IPR availableGENE3D3.90.70.10Cysteine proteinasescoord: 191..550
e-value: 2.8E-93
score: 314.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 70..84
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..41
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..92
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 53..69
NoneNo IPR availablePANTHERPTHR21646:SF71BNAA06G13940D PROTEINcoord: 79..549
NoneNo IPR availablePANTHERPTHR21646UBIQUITIN CARBOXYL-TERMINAL HYDROLASEcoord: 79..549
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 100..194
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 102..190
e-value: 1.4E-41
score: 142.3
IPR001394Peptidase C19, ubiquitin carboxyl-terminal hydrolasePFAMPF00443UCHcoord: 222..546
e-value: 3.5E-37
score: 128.3
IPR028889Ubiquitin specific protease domainPROSITEPS50235USP_3coord: 222..549
score: 35.296654
IPR033809USP39CDDcd02669Peptidase_C19Mcoord: 103..547
e-value: 0.0
score: 681.352
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 214..548

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr020857.1Sgr020857.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016579 protein deubiquitination
biological_process GO:0000245 spliceosomal complex assembly
biological_process GO:0006397 mRNA processing
cellular_component GO:0005681 spliceosomal complex
molecular_function GO:0004843 thiol-dependent deubiquitinase
molecular_function GO:0008270 zinc ion binding