2017-03-02 105 views
1

我想反序列化从ZeroMQ收到的谷歌protobuf消息,并试图转换为JSON格式,使用下面的一段代码。但在最终输出中,定义为字节的字段是不可读的。如何反序列化从Google protobuf以Java接收的字节?

(例如,"source_id": "\u0000PV\uff98t\uff9e")。

由于它是机器生成的数据,我们没有从源发送的实际值。

InputStream is = new ByteArrayInputStream(message.getBytes()); 
Schema.nb_event data = Schema.nb_event.parseFrom(is); 
String jsonFormat = JsonFormat.printToString(data); 

输出

{ "seq": 6479250, "timestamp": 1488461706,"op": "OP_UPDATE","topic_seq": 595736,"source_id": "\u0000PV\uff98t\uff9e","location": {"sta_eth_mac": {"addr": "xxxxxxx"},"sta_location_x": 879.11456,"sta_location_y": 945.0676,"error_level": 1220,"associated": true,"campus_id": "\uff9f\uff94\uffc7\uffa3\uffa2\b6\uffe3\uff92U\uff9f\uffdcN\'MT","building_id": "\uffee\u0016??X}5\u001a\uffaa\uffc4^\uffa0n\uffa4\ufffb\'","floor_id": "\uffd9/\"uF\uffdd3\uffdd\uff96\u0015\uff83~\u0005\uff8a(\uffd0","hashed_sta_eth_mac": "\u0013h\u0017\uffd0\uffef\uffc8\u001f\u0005V\u0010w?xxxxxx","loc_algorithm": "ALGORITHM_LOW_DENSITY","unit": "FEET"}} 

==

{ "seq":   6479250, 
    "timestamp": 1488461706, 
    "op":     "OP_UPDATE", 
    "topic_seq":  595736, 
    "source_id":   "\u0000PV\uff98t\uff9e", 
    "location":   { "sta_eth_mac":   { "addr": "\uffc0\uffcc\ufff8P\uffee." }, 
         "sta_location_x": 879.11456, 
         "sta_location_y": 945.0676, 
         "error_level":  1220, 
         "associated":   true, 
         "campus_id":   "\uff9f\uff94\uffc7\uffa3\uffa2\b6\uffe3\uff92U\uff9f\uffdcN\'MT", 
         "building_id":   "\uffee\u0016??X}5\u001a\uffaa\uffc4^\uffa0n\uffa4\ufffb\'", 
         "floor_id":    "\uffd9/\"uF\uffdd3\uffdd\uff96\u0015\uff83~\u0005\uff8a(\uffd0", 
         "hashed_sta_eth_mac": "\u0013h\u0017\uffd0\uffef\uffc8\u001f\u0005V\u0010w?\uff88\uffa8\uffee\u000fm.\u0015\uffe9", 
         "loc_algorithm":  "ALGORITHM_LOW_DENSITY", 
         "unit":     "FEET" 
         } 
    } 

所有不可读字段定义在.proto文件字节。
获取这些值是否需要额外的步骤?

optional bytes building_id  = 10; 
    optional bytes floor_id   = 11; 
    optional bytes hashed_sta_eth_mac = 12; 
+0

嗯,是不透明的二进制数据。你为什么期望能够将它看作文本?基本上在Java中,他们将是一个'byte []'。无可否认,我希望JSON格式化程序可以将它们转换为base64格式,而不是你所显示的格式,但它仍然不会是可读的文本。 –

+0

事实上,正常的'JsonFormat'确实应该将这些字节序列化为Base64:https://github.com/google/protobuf/blob/34a1b6e6b8c0d477504d09df4df4b86770e47872/java/util/src/main/java/com/google/protobuf/util /JsonFormat.java#L992是你正在使用的'JsonFormat'类,还是这是别的? (另一方面,'mesage'是一个'String'还是其他的东西?如果是这样,调用'getBytes()'就是个坏主意)。基本上,[mcve]会让你更容易帮助你。 –

+0

需求是反序列化数据并在我们的数据湖中以JSON格式写成纯文本格式。是否有工作来获得实际价值? –

回答

0

的JSON格式com.googlecode.protobuf.format.JsonFormat返航为字节,但我能获得需要的格式的base64字符串改变JSON格式来com.google.protobuf.util.JsonFormat后。

相关问题