PaddleOcr结果是乱码,这什么情况
收藏
用java将图片base64编码后,post到paddlehub。模型用的chinese_ocr_db_crnn_server。
结果返回是乱码,各位名宿们帮忙看看咋回事
0
收藏
请登录后评论
再转下?
啥意思?
字符集的问题
这个其实不是乱码,结果已经出来了
显示的这个其实是Unicode字符串代码,下一步需要做一个Unicode转中文的实现
看你用的java,不妨参考:https://blog.csdn.net/Yinbin_/article/details/104500403
UNICODE解码后还是乱码
解码前:{"msg":"","results":[{"save_path":"","data":[{"text_box_position":[[376,74],[592,74],[592,197],[376,197]],"confidence":0.9957236647605896,"text":"æ\u0098¥æ\u0099\u0093"},{"text_box_position":[[636,126],[781,128],[780,182],[635,179]],"confidence":0.9970438480377197,"text":"å\u009F浩ç\u0084¶"},{"text_box_position":[[69,260],[959,260],[959,351],[69,351]],"confidence":0.9056652188301086,"text":"æ\u0098¥ç\u009C ä¸\u008Dè§\u0089æ\u0099\u0093ï¼\u008Cå¤\u0084å¤\u0084é\u0097»ç\u009Bé¸\u009F"},{"text_box_position":[[70,359],[966,359],[966,450],[70,450]],"confidence":0.9990629553794861,"text":"å¤\u009Cæ\u009D¥é£\u008Eé\u009B¨å£°ï¼\u008Cè\u008A±è\u0090½ç\u009F¥å¤\u009Aå°\u0091"}]}],"status":"000"}
解码后:
{"msg":"","results":[{"save_path":"","data":[{"text_box_position":[[376,74],[592,74],[592,197],[376,197]],"confidence":0.9957236647605896,"text":"¥"},{"text_box_position":[[636,126],[781,128],[780,182],[635,179]],"confidence":0.9970438480377197,"text":"åæµ©¶"},{"text_box_position":[[69,260],[959,260],[959,351],[69,351]],"confidence":0.9056652188301086,"text":"¥ äèïåå»çé"},{"text_box_position":[[70,359],[966,359],[966,450],[70,450]],"confidence":0.9990629553794861,"text":"å¥é¨å£°ï±½¥åå"}]}],"status":"000"
为什么是UNICODE 这个是在哪里设置的?
PaddleOCR-release-2.3\deploy\cpp_infer\src识别中文时出现乱码_peddleocr识别文字乱码-CSDN博客