Moc03g20850 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g20850
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr3: 14240603 .. 14241349 (+)
RNA-Seq ExpressionMoc03g20850
SyntenyMoc03g20850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGAGTAATCTTGCAATCAGCAATGAAACCCCAAGTGGACAGCAAATACACGCAGCAAATGGAGCAGGTTTGTCCATAACCCATACTGAACAATCCTCTTTTATCTCACCATCTAACCATGTTTTCCATCTTAAAAATCTTCTCCATGTTCCTTCAGTAACCAAAAACCTTCTAACTGTAAGTCAGTTTGCACATGATAACAATATTTATTTTGAATTTCACCCATCTTTTTGTCTTGTTAAGGACCAAGCTACTGGTAACATCCTTCTCAAAGGGTTACTCCAAGATGGACTTTATTCCTTCAAGCTGAATGCATCTACATCATCATCTCAAAGTCAATCCTCAGTATATCATGTCTCTAATCCGAGCTCTTCTTCTTCAACAACAAATTCATCTCTTCATTGTAAATCAAATCTTGATATTTGGCATAATAGACTAGGACATCCTTCTACCCTGGTTATTACTCAAGTTCTAAAACAATATACTATTTCCTCTCAATTCAATAAAAATCTTTCATTCTGTCAAGCTTGTGCGATAGAAAAGAATCATTCCCTTCCTTTTCCCACCTCCTCCACTACATATACAATACCTCTACAACTTATAGTTTCTGATCTATGGGGGCCATCTTATAAAACTTCACGAAATGGCTATAAATATTACATATCTTTTATTGATGCTTACTCAAGATTCACCTGGATCTACTTTCTAGAATCCAAGGCACGAGCTTTTTCAGCTTTCATTTAA

mRNA sequence

ATGATGAGTAATCTTGCAATCAGCAATGAAACCCCAAGTGGACAGCAAATACACGCAGCAAATGGAGCAGGTTTGTCCATAACCCATACTGAACAATCCTCTTTTATCTCACCATCTAACCATGTTTTCCATCTTAAAAATCTTCTCCATGTTCCTTCAGTAACCAAAAACCTTCTAACTGTAAGTCAGTTTGCACATGATAACAATATTTATTTTGAATTTCACCCATCTTTTTGTCTTGTTAAGGACCAAGCTACTGGTAACATCCTTCTCAAAGGGTTACTCCAAGATGGACTTTATTCCTTCAAGCTGAATGCATCTACATCATCATCTCAAAGTCAATCCTCAGTATATCATGTCTCTAATCCGAGCTCTTCTTCTTCAACAACAAATTCATCTCTTCATTGTAAATCAAATCTTGATATTTGGCATAATAGACTAGGACATCCTTCTACCCTGGTTATTACTCAAGTTCTAAAACAATATACTATTTCCTCTCAATTCAATAAAAATCTTTCATTCTGTCAAGCTTGTGCGATAGAAAAGAATCATTCCCTTCCTTTTCCCACCTCCTCCACTACATATACAATACCTCTACAACTTATAGTTTCTGATCTATGGGGGCCATCTTATAAAACTTCACGAAATGGCTATAAATATTACATATCTTTTATTGATGCTTACTCAAGATTCACCTGGATCTACTTTCTAGAATCCAAGGCACGAGCTTTTTCAGCTTTCATTTAA

Coding sequence (CDS)

ATGATGAGTAATCTTGCAATCAGCAATGAAACCCCAAGTGGACAGCAAATACACGCAGCAAATGGAGCAGGTTTGTCCATAACCCATACTGAACAATCCTCTTTTATCTCACCATCTAACCATGTTTTCCATCTTAAAAATCTTCTCCATGTTCCTTCAGTAACCAAAAACCTTCTAACTGTAAGTCAGTTTGCACATGATAACAATATTTATTTTGAATTTCACCCATCTTTTTGTCTTGTTAAGGACCAAGCTACTGGTAACATCCTTCTCAAAGGGTTACTCCAAGATGGACTTTATTCCTTCAAGCTGAATGCATCTACATCATCATCTCAAAGTCAATCCTCAGTATATCATGTCTCTAATCCGAGCTCTTCTTCTTCAACAACAAATTCATCTCTTCATTGTAAATCAAATCTTGATATTTGGCATAATAGACTAGGACATCCTTCTACCCTGGTTATTACTCAAGTTCTAAAACAATATACTATTTCCTCTCAATTCAATAAAAATCTTTCATTCTGTCAAGCTTGTGCGATAGAAAAGAATCATTCCCTTCCTTTTCCCACCTCCTCCACTACATATACAATACCTCTACAACTTATAGTTTCTGATCTATGGGGGCCATCTTATAAAACTTCACGAAATGGCTATAAATATTACATATCTTTTATTGATGCTTACTCAAGATTCACCTGGATCTACTTTCTAGAATCCAAGGCACGAGCTTTTTCAGCTTTCATTTAA

Protein sequence

MMSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPSNHVFHLKNLLHVPSVTKNLLTVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSSSQSQSSVYHVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYFLESKARAFSAFI
Homology
BLAST of Moc03g20850 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 236.9 bits (603), Expect = 1.9e-58
Identity = 131/251 (52.19%), Postives = 166/251 (66.14%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           +SNL+I +E   G QI+AANG+GL ITH    SF S +     F L NLL VPS+TKNL+
Sbjct: 362 LSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLI 421

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSSSQSQSSVYH 121
           +VSQFA DN+++FEFHP+ C VKD  TG +LL+GLL DGLY F +  S           H
Sbjct: 422 SVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKR-------LH 481

Query: 122 VSNPSSSSSTTNSSLHCKSN---LDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQ 181
            SN  S++    +++  KSN   LD+WH RLGHP   ++  VL     SS     L+FC+
Sbjct: 482 HSN--SNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCE 541

Query: 182 ACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYF 241
           ACA+ K+H+LPF  S T YT PLQLI  DLWGP+   S NG++YYISF+DAYSR+TWIYF
Sbjct: 542 ACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYF 601

Query: 242 LESKARAFSAF 248
           L SK+ AF AF
Sbjct: 602 LNSKSDAFLAF 603

BLAST of Moc03g20850 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 236.9 bits (603), Expect = 1.9e-58
Identity = 131/251 (52.19%), Postives = 166/251 (66.14%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           +SNL+I +E   G QI+AANG+GL ITH    SF S +     F L NLL VPS+TKNL+
Sbjct: 362 LSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLI 421

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSSSQSQSSVYH 121
           +VSQFA DN+++FEFHP+ C VKD  TG +LL+GLL DGLY F +  S           H
Sbjct: 422 SVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKR-------LH 481

Query: 122 VSNPSSSSSTTNSSLHCKSN---LDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQ 181
            SN  S++    +++  KSN   LD+WH RLGHP   ++  VL     SS     L+FC+
Sbjct: 482 HSN--SNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCE 541

Query: 182 ACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYF 241
           ACA+ K+H+LPF  S T YT PLQLI  DLWGP+   S NG++YYISF+DAYSR+TWIYF
Sbjct: 542 ACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYF 601

Query: 242 LESKARAFSAF 248
           L SK+ AF AF
Sbjct: 602 LNSKSDAFLAF 603

BLAST of Moc03g20850 vs. NCBI nr
Match: KZV26181.1 (hypothetical protein F511_06348 [Dorcoceras hygrometricum])

HSP 1 Score: 231.5 bits (589), Expect = 8.0e-57
Identity = 122/253 (48.22%), Postives = 167/253 (66.01%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSF-ISPSNHVFHLKNLLHVPSVTKNLLT 61
           + NL++S+E   G ++   NGAGLSI++  +S+  + PS+  F LKNLLHVP +TKNL++
Sbjct: 286 LGNLSVSSEYTGGSKVQVGNGAGLSISNIGESNLNMFPSSRPFLLKNLLHVPLITKNLIS 345

Query: 62  VSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSS-----SQSQS 121
           VS+FA+DN++YFEFHPSFCLVKD AT  +LL+G L +GLY F L +  S      +  QS
Sbjct: 346 VSKFAYDNHVYFEFHPSFCLVKDPATHVVLLRGTLHNGLYRFNLKSRISGPLHSPACLQS 405

Query: 122 SVYHVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFC 181
           SV  +  P  S          ++ LD WH RLGHPS   + QVL         N N+SFC
Sbjct: 406 SVSPIKVPDQSPLCLP-----QNTLDKWHLRLGHPSIATVKQVLLDCNERISKNDNISFC 465

Query: 182 QACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIY 241
            +C + KNH LPFP S+T ++ P +++ SDLWGP++  SRNG +YYISF+DAY+R+TWIY
Sbjct: 466 SSCQLGKNHLLPFPQSTTNFSAPFEVVYSDLWGPAHIPSRNGSRYYISFVDAYTRYTWIY 525

Query: 242 FLESKARAFSAFI 249
           FL+ K+     FI
Sbjct: 526 FLKLKSEVTQTFI 533

BLAST of Moc03g20850 vs. NCBI nr
Match: RVW60229.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 220.7 bits (561), Expect = 1.4e-53
Identity = 122/263 (46.39%), Postives = 162/263 (61.60%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           + NL    E     +IH  NG GL I+H   S F S S  N V  LKN+L VP++ KNLL
Sbjct: 508 LGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLL 567

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLN---------ASTSS 121
           +VSQFA DNN+YFEFHP  C VKD++  ++LL+G L  GLY F L+          S S+
Sbjct: 568 SVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSN 627

Query: 122 SQSQ-----SSVYHVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTIS 181
            +++     +S+ H  N S     TNSS H     D+WH RLGHP++ ++TQVL    I 
Sbjct: 628 DKNELTCCNASLVHNDN-SDFPEKTNSSFHV---FDLWHKRLGHPASKIVTQVLNDNKIP 687

Query: 182 SQFNKNLSFCQACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFI 241
                  S C AC + K+H+LPFP S T YT PLQL+VSDLWGP+   S  G+ YY+SF+
Sbjct: 688 FSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFV 747

Query: 242 DAYSRFTWIYFLESKARAFSAFI 249
           DAYSR+TW+YFL++K++   AF+
Sbjct: 748 DAYSRYTWVYFLKTKSQTREAFL 766

BLAST of Moc03g20850 vs. NCBI nr
Match: RVW44519.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 220.7 bits (561), Expect = 1.4e-53
Identity = 122/263 (46.39%), Postives = 162/263 (61.60%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           + NL    E     +IH  NG GL I+H   S F S S  N V  LKN+L VP++ KNLL
Sbjct: 372 LGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLL 431

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLN---------ASTSS 121
           +VSQFA DNN+YFEFHP  C VKD++  ++LL+G L  GLY F L+          S S+
Sbjct: 432 SVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSN 491

Query: 122 SQSQ-----SSVYHVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTIS 181
            +++     +S+ H  N S     TNSS H     D+WH RLGHP++ ++TQVL    I 
Sbjct: 492 DKNELTCCNASLVHNDN-SDFPEKTNSSFHV---FDLWHKRLGHPASKIVTQVLNDNKIP 551

Query: 182 SQFNKNLSFCQACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFI 241
                  S C AC + K+H+LPFP S T YT PLQL+VSDLWGP+   S  G+ YY+SF+
Sbjct: 552 FSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFV 611

Query: 242 DAYSRFTWIYFLESKARAFSAFI 249
           DAYSR+TW+YFL++K++   AF+
Sbjct: 612 DAYSRYTWVYFLKTKSQTREAFL 630

BLAST of Moc03g20850 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 5.6e-29
Identity = 86/248 (34.68%), Postives = 139/248 (56.05%), Query Frame = 0

Query: 3   SNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPSNHVFHLKNLLHVPSVTKNLLTVS 62
           +NL++      G  +  A+G+ + I+HT  +S +S  +   +L N+L+VP++ KNL++V 
Sbjct: 346 NNLSLHQPYTGGDDVMVADGSTIPISHTGSTS-LSTKSRPLNLHNILYVPNIHKNLISVY 405

Query: 63  QFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSSSQSQSSVYHVSN 122
           +  + N +  EF P+   VKD  TG  LL+G  +D LY + +    +SSQ  S       
Sbjct: 406 RLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPI----ASSQPVSLF----- 465

Query: 123 PSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTIS--SQFNKNLSFCQACAI 182
            S SS  T+SS         WH RLGHP+  ++  V+  Y++S  +  +K LS C  C I
Sbjct: 466 ASPSSKATHSS---------WHARLGHPAPSILNSVISNYSLSVLNPSHKFLS-CSDCLI 525

Query: 183 EKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYFLESK 242
            K++ +PF  S+   T PL+ I SD+W  S   S + Y+YY+ F+D ++R+TW+Y L+ K
Sbjct: 526 NKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQK 572

Query: 243 ARAFSAFI 249
           ++    FI
Sbjct: 586 SQVKETFI 572

BLAST of Moc03g20850 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 3.6e-28
Identity = 85/248 (34.27%), Postives = 137/248 (55.24%), Query Frame = 0

Query: 3   SNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPSNHVFHLKNLLHVPSVTKNLLTVS 62
           +NL+       G  +  A+G+ + ITHT  +S +  S+    L  +L+VP++ KNL++V 
Sbjct: 325 NNLSFHQPYTGGDDVMIADGSTIPITHTGSAS-LPTSSRSLDLNKVLYVPNIHKNLISVY 384

Query: 63  QFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSSSQSQSSVYHVSN 122
           +  + N +  EF P+   VKD  TG  LL+G  +D LY + +    +SSQ+ S       
Sbjct: 385 RLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPI----ASSQAVSMF----- 444

Query: 123 PSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYT--ISSQFNKNLSFCQACAI 182
            S  S  T+SS         WH+RLGHPS  ++  V+  ++  + +  +K LS C  C I
Sbjct: 445 ASPCSKATHSS---------WHSRLGHPSLAILNSVISNHSLPVLNPSHKLLS-CSDCFI 504

Query: 183 EKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYFLESK 242
            K+H +PF  S+ T + PL+ I SD+W  S   S + Y+YY+ F+D ++R+TW+Y L+ K
Sbjct: 505 NKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQK 551

Query: 243 ARAFSAFI 249
           ++    FI
Sbjct: 565 SQVKDTFI 551

BLAST of Moc03g20850 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 1.6e-12
Identity = 39/109 (35.78%), Postives = 57/109 (52.29%), Query Frame = 0

Query: 139 NLDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQACAIEKNHSLPFPTSSTTYTIP 198
           ++D+WH R+GH S   +  + K+  IS      +  C  C   K H + F TSS      
Sbjct: 421 SVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNI 480

Query: 199 LQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYFLESKARAFSAF 248
           L L+ SD+ GP    S  G KY+++FID  SR  W+Y L++K + F  F
Sbjct: 481 LDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVF 529

BLAST of Moc03g20850 vs. ExPASy Swiss-Prot
Match: P93293 (Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana OX=3702 GN=AtMg00300 PE=4 SV=1)

HSP 1 Score: 48.5 bits (114), Expect = 1.3e-04
Identity = 27/90 (30.00%), Postives = 43/90 (47.78%), Query Frame = 0

Query: 119 HVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQAC 178
           ++   S  +  +N +   K    +WH+RL H S   +  ++K+  + S    +L FC+ C
Sbjct: 48  YILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDC 107

Query: 179 AIEKNHSLPFPTSSTTYTIPLQLIVSDLWG 209
              K H + F T   T   PL  + SDLWG
Sbjct: 108 IYGKTHRVNFSTGQHTTKNPLDYVHSDLWG 137

BLAST of Moc03g20850 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 9.2e-59
Identity = 131/251 (52.19%), Postives = 166/251 (66.14%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           +SNL+I +E   G QI+AANG+GL ITH    SF S +     F L NLL VPS+TKNL+
Sbjct: 362 LSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLI 421

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSSSQSQSSVYH 121
           +VSQFA DN+++FEFHP+ C VKD  TG +LL+GLL DGLY F +  S           H
Sbjct: 422 SVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKR-------LH 481

Query: 122 VSNPSSSSSTTNSSLHCKSN---LDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQ 181
            SN  S++    +++  KSN   LD+WH RLGHP   ++  VL     SS     L+FC+
Sbjct: 482 HSN--SNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCE 541

Query: 182 ACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYF 241
           ACA+ K+H+LPF  S T YT PLQLI  DLWGP+   S NG++YYISF+DAYSR+TWIYF
Sbjct: 542 ACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYF 601

Query: 242 LESKARAFSAF 248
           L SK+ AF AF
Sbjct: 602 LNSKSDAFLAF 603

BLAST of Moc03g20850 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 9.2e-59
Identity = 131/251 (52.19%), Postives = 166/251 (66.14%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           +SNL+I +E   G QI+AANG+GL ITH    SF S +     F L NLL VPS+TKNL+
Sbjct: 362 LSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLI 421

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSSSQSQSSVYH 121
           +VSQFA DN+++FEFHP+ C VKD  TG +LL+GLL DGLY F +  S           H
Sbjct: 422 SVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKR-------LH 481

Query: 122 VSNPSSSSSTTNSSLHCKSN---LDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQ 181
            SN  S++    +++  KSN   LD+WH RLGHP   ++  VL     SS     L+FC+
Sbjct: 482 HSN--SNTKPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCE 541

Query: 182 ACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIYF 241
           ACA+ K+H+LPF  S T YT PLQLI  DLWGP+   S NG++YYISF+DAYSR+TWIYF
Sbjct: 542 ACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYF 601

Query: 242 LESKARAFSAF 248
           L SK+ AF AF
Sbjct: 602 LNSKSDAFLAF 603

BLAST of Moc03g20850 vs. ExPASy TrEMBL
Match: A0A2Z7AWA7 (Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_06348 PE=4 SV=1)

HSP 1 Score: 231.5 bits (589), Expect = 3.8e-57
Identity = 122/253 (48.22%), Postives = 167/253 (66.01%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSF-ISPSNHVFHLKNLLHVPSVTKNLLT 61
           + NL++S+E   G ++   NGAGLSI++  +S+  + PS+  F LKNLLHVP +TKNL++
Sbjct: 286 LGNLSVSSEYTGGSKVQVGNGAGLSISNIGESNLNMFPSSRPFLLKNLLHVPLITKNLIS 345

Query: 62  VSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLNASTSS-----SQSQS 121
           VS+FA+DN++YFEFHPSFCLVKD AT  +LL+G L +GLY F L +  S      +  QS
Sbjct: 346 VSKFAYDNHVYFEFHPSFCLVKDPATHVVLLRGTLHNGLYRFNLKSRISGPLHSPACLQS 405

Query: 122 SVYHVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFC 181
           SV  +  P  S          ++ LD WH RLGHPS   + QVL         N N+SFC
Sbjct: 406 SVSPIKVPDQSPLCLP-----QNTLDKWHLRLGHPSIATVKQVLLDCNERISKNDNISFC 465

Query: 182 QACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFIDAYSRFTWIY 241
            +C + KNH LPFP S+T ++ P +++ SDLWGP++  SRNG +YYISF+DAY+R+TWIY
Sbjct: 466 SSCQLGKNHLLPFPQSTTNFSAPFEVVYSDLWGPAHIPSRNGSRYYISFVDAYTRYTWIY 525

Query: 242 FLESKARAFSAFI 249
           FL+ K+     FI
Sbjct: 526 FLKLKSEVTQTFI 533

BLAST of Moc03g20850 vs. ExPASy TrEMBL
Match: A0A438FJP6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1134 PE=4 SV=1)

HSP 1 Score: 220.7 bits (561), Expect = 6.8e-54
Identity = 122/263 (46.39%), Postives = 162/263 (61.60%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           + NL    E     +IH  NG GL I+H   S F S S  N V  LKN+L VP++ KNLL
Sbjct: 508 LGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLL 567

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLN---------ASTSS 121
           +VSQFA DNN+YFEFHP  C VKD++  ++LL+G L  GLY F L+          S S+
Sbjct: 568 SVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSN 627

Query: 122 SQSQ-----SSVYHVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTIS 181
            +++     +S+ H  N S     TNSS H     D+WH RLGHP++ ++TQVL    I 
Sbjct: 628 DKNELTCCNASLVHNDN-SDFPEKTNSSFHV---FDLWHKRLGHPASKIVTQVLNDNKIP 687

Query: 182 SQFNKNLSFCQACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFI 241
                  S C AC + K+H+LPFP S T YT PLQL+VSDLWGP+   S  G+ YY+SF+
Sbjct: 688 FSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFV 747

Query: 242 DAYSRFTWIYFLESKARAFSAFI 249
           DAYSR+TW+YFL++K++   AF+
Sbjct: 748 DAYSRYTWVYFLKTKSQTREAFL 766

BLAST of Moc03g20850 vs. ExPASy TrEMBL
Match: A0A438EA49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2917 PE=4 SV=1)

HSP 1 Score: 220.7 bits (561), Expect = 6.8e-54
Identity = 122/263 (46.39%), Postives = 162/263 (61.60%), Query Frame = 0

Query: 2   MSNLAISNETPSGQQIHAANGAGLSITHTEQSSFISPS--NHVFHLKNLLHVPSVTKNLL 61
           + NL    E     +IH  NG GL I+H   S F S S  N V  LKN+L VP++ KNLL
Sbjct: 372 LGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLL 431

Query: 62  TVSQFAHDNNIYFEFHPSFCLVKDQATGNILLKGLLQDGLYSFKLN---------ASTSS 121
           +VSQFA DNN+YFEFHP  C VKD++  ++LL+G L  GLY F L+          S S+
Sbjct: 432 SVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSN 491

Query: 122 SQSQ-----SSVYHVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTIS 181
            +++     +S+ H  N S     TNSS H     D+WH RLGHP++ ++TQVL    I 
Sbjct: 492 DKNELTCCNASLVHNDN-SDFPEKTNSSFHV---FDLWHKRLGHPASKIVTQVLNDNKIP 551

Query: 182 SQFNKNLSFCQACAIEKNHSLPFPTSSTTYTIPLQLIVSDLWGPSYKTSRNGYKYYISFI 241
                  S C AC + K+H+LPFP S T YT PLQL+VSDLWGP+   S  G+ YY+SF+
Sbjct: 552 FSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFV 611

Query: 242 DAYSRFTWIYFLESKARAFSAFI 249
           DAYSR+TW+YFL++K++   AF+
Sbjct: 612 DAYSRYTWVYFLKTKSQTREAFL 630

BLAST of Moc03g20850 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 48.5 bits (114), Expect = 8.9e-06
Identity = 27/90 (30.00%), Postives = 43/90 (47.78%), Query Frame = 0

Query: 119 HVSNPSSSSSTTNSSLHCKSNLDIWHNRLGHPSTLVITQVLKQYTISSQFNKNLSFCQAC 178
           ++   S  +  +N +   K    +WH+RL H S   +  ++K+  + S    +L FC+ C
Sbjct: 48  YILQGSVETGESNLAETAKDETRLWHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDC 107

Query: 179 AIEKNHSLPFPTSSTTYTIPLQLIVSDLWG 209
              K H + F T   T   PL  + SDLWG
Sbjct: 108 IYGKTHRVNFSTGQHTTKNPLDYVHSDLWG 137

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0048297.11.9e-5852.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK10642.11.9e-5852.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KZV26181.18.0e-5748.22hypothetical protein F511_06348 [Dorcoceras hygrometricum][more]
RVW60229.11.4e-5346.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW44519.11.4e-5346.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
Q94HW25.6e-2934.68Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT943.6e-2834.27Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.6e-1235.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P932931.3e-0430.00Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5A7U2339.2e-5952.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3CH979.2e-5952.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A2Z7AWA73.8e-5748.22Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472... [more]
A0A438FJP66.8e-5446.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438EA496.8e-5446.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
ATMG00300.18.9e-0630.00Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 119..182
e-value: 1.8E-8
score: 34.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 190..248
e-value: 9.2E-8
score: 33.6
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 23..240
NoneNo IPR availablePANTHERPTHR11439:SF315RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN PROTEIN-RELATEDcoord: 23..240
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 194..246

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g20850.1Moc03g20850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding