"ParseFromString()" and "JsonStringToMessage()" have inconsistent behavior when parsing an unrecognized enum value.

1,976 views
Skip to first unread message

Qian Zhang

unread,
Oct 10, 2017, 3:29:09 AM10/10/17
to Protocol Buffers
Hi,

I am using protobuf-3.3.0, and I found when parsing an unrecognized enum value for an optional enum field, the behaviors of "ParseFromString()" and "JsonStringToMessage()" are different. "ParseFromString()" will succeed and the field's getter method will return the default enum value, but "JsonStringToMessage" will fail with an error:
invalid value "xxx" for type TYPE_ENUM

Is this a bug of protobuf-3.3.0? IMHO, "ParseFromString()" and "JsonStringToMessage()" should have consistent behavior.


Thanks,
Qian

Feng Xiao

unread,
Oct 10, 2017, 5:18:37 PM10/10/17
to Qian Zhang, Protocol Buffers
This is the expected behavior. The issue at core is that protobuf message is only able to hold unknown binary data. It's designed that way with its UnknownFieldSet data structure and you can not achieve the same with other formats.
 


Thanks,
Qian

--
You received this message because you are subscribed to the Google Groups "Protocol Buffers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to protobuf+unsubscribe@googlegroups.com.
To post to this group, send email to prot...@googlegroups.com.
Visit this group at https://groups.google.com/group/protobuf.
For more options, visit https://groups.google.com/d/optout.

Feng Xiao

unread,
Oct 10, 2017, 5:20:10 PM10/10/17
to Qian Zhang, Protocol Buffers
Note that you can use the always_print_enums_as_ints option to work around this problem:

With this option enum value will printed as integers and will be accepted by JsonStringToMessage.

Qian Zhang

unread,
Oct 10, 2017, 11:15:44 PM10/10/17
to Feng Xiao, Protocol Buffers
Note that you can use the always_print_enums_as_ints option to work around this problem:
https://github.com/google/protobuf/blob/master/src/google/protobuf/util/json_util.h#L73
 
With this option enum value will printed as integers and will be accepted by JsonStringToMessage.

"always_print_enums_as_ints" is an option for the method "MessageToJsonString()", but the issue that I am talking about is the method "JsonStringToMessage()", the only option that "JsonStringToMessage()" accepts is "JsonParseOptions.ignore_unknown_fields", but I do not think it can help here.

This is the expected behavior. The issue at core is that protobuf message is only able to hold unknown binary data. It's designed that way with its UnknownFieldSet data structure and you can not achieve the same with other formats.

But that behavior is not desired for us :-( We are implementing a server which can accept both protobuf serialized string and JSON string from clients, and then the server will call "ParseFromString()" and "JsonStringToMessage" to get the protobuf message from the serialized string and JSON string respectively. But now these two methods have different behaviors, "ParseFromString()" can successfully parse a serialized string which contains unrecognized enum value, but "JsonStringToMessage()" will fail to do that. This makes our server can not behave consistently for protobuf serialized string and JSON string.

Any suggestions? Thanks!


Regards,
Qian Zhang

Qian Zhang

unread,
Oct 17, 2017, 5:13:51 AM10/17/17
to Protocol Buffers
Feng, any comments? :-)

Feng Xiao

unread,
Oct 17, 2017, 2:52:06 PM10/17/17
to Qian Zhang, Protocol Buffers
On Tue, Oct 10, 2017 at 8:15 PM, Qian Zhang <zhq5...@gmail.com> wrote:
Note that you can use the always_print_enums_as_ints option to work around this problem:
https://github.com/google/protobuf/blob/master/src/google/protobuf/util/json_util.h#L73
 
With this option enum value will printed as integers and will be accepted by JsonStringToMessage.

"always_print_enums_as_ints" is an option for the method "MessageToJsonString()", but the issue that I am talking about is the method "JsonStringToMessage()", the only option that "JsonStringToMessage()" accepts is "JsonParseOptions.ignore_unknown_fields", but I do not think it can help here.

This is the expected behavior. The issue at core is that protobuf message is only able to hold unknown binary data. It's designed that way with its UnknownFieldSet data structure and you can not achieve the same with other formats.

But that behavior is not desired for us :-( We are implementing a server which can accept both protobuf serialized string and JSON string from clients, and then the server will call "ParseFromString()" and "JsonStringToMessage" to get the protobuf message from the serialized string and JSON string respectively. But now these two methods have different behaviors, "ParseFromString()" can successfully parse a serialized string which contains unrecognized enum value, but "JsonStringToMessage()" will fail to do that. This makes our server can not behave consistently for protobuf serialized string and JSON string.

Any suggestions? Thanks!
There really isn't a good way to solve this. The server has to know the enum definition to be able to accept it in JSON format. You will either need to make sure the server is updated with the new definition before sending it JSON data using the new enum names, or use integer instead of enum names in the request.
Reply all
Reply to author
Forward
0 new messages