识别出来的相差甚远
qiyuhb 发布于2017-10 浏览:1417 回复:4
0
收藏

我使用的服务是:语音识别

调用的接口是:rest api

参数设置情况:

开发平台和机型:树莓派pi

SDK版本号:

代码或日志截图(上传截图能帮助您更快解决问题):

wav_fp = wave.open(filename,'rb')

nf = wav_fp.getnframes()

f_len = nf * 2

print f_len

audio_data = wav_fp.readframes(nf)

#print audio_data

http_header = [

        'Content-Type: audio/wav; rate=8000',

        'Content-Length: %d' % f_len

]

c=pycurl.Curl()

c.setopt(pycurl.URL,str(srv_url))

c.setopt(c.HTTPHEADER,http_header)

c.setopt(c.POST,1)

c.setopt(c.CONNECTTIMEOUT,30)

c.setopt(c.TIMEOUT,30)

c.setopt(c.WRITEFUNCTION,dump_res)

c.setopt(c.POSTFIELDS,audio_data)

c.setopt(c.POSTFIELDSIZE, f_len)

c.perform()

复现步骤、现象及其他描述:

{"corpus_no":"6474897815491553526","err_msg":"success.","err_no":0,"result":["哈哈,"],"sn":"71181167701507554625"}

文件格式:c.wav: RIFF (little-endian) data, WAVE audio, Microsoft PCM, 8 bit, mono 8000 Hz

收藏
点赞
0
个赞
共4条回复 最后由用户已被禁言回复于2022-04
#13用户已被禁言回复于2022-04

https://b2b.baidu.com/s?q=%E6%96%B0%E4%B8%96%E7%95%8C%E5%AE%98%E7%BD%91%E5%AE%A2%E6%9C%8D%E7%94%B5%E8%AF%9D%E7%9B%B4%E8%90%A5%E7%BD%91TL262%C2%B7%EF%BC%A3%EF%BC%AF%EF%BC%AD%E5%BB%B6%E5%AE%89%E4%BC%81%E4%B8%9A%EF%BC%8F&from=search http://www.882667.com/ip_%E8%80%81%E8%A1%97%E7%9A%87%E5%AE%B6%E5%88%A9%E5%8D%8E%E5%A8%B1%E4%B9%90%E7%BB%8F%E7%90%86%E7%83%AD%E7%BA%BF%E6%B8%B8%E6%88%8F%E7%BD%91%E5%9D%80TL262%C2%B7C%EF%BC%AFM%E5%85%AD%E5%AE%89%E7%94%9F%E6%B4%BB%EF%BC%AA%EF%BC%97.html https://www.iqiyi.com/manhua/search-keyword=%E7%8E%8B%E8%80%85%E8%87%B3%E5%B0%8A%E5%AD%98%E6%AC%BE%E5%8F%96%E6%AC%BE%E3%80%90%E5%AE%98%E7%BD%91%E2%86%92TL262%C2%B7%EF%BC%A3%EF%BC%AF%EF%BC%AD%E3%80%91%E5%94%90%E5%B1%B1%E5%8D%AB%E8%AE%A1%E5%A7%94%EF%BC%BD%EF%BC%BA http://search.114chn.com/SearchResult.aspx?key=%E7%BC%85%E7%94%B8%E8%85%BE%E9%BE%99%E5%A8%B1%E4%B9%90%E5%BC%80%E6%88%B7%E5%AE%98%E7%BD%91TL262%C2%B7C%EF%BC%AFM%E8%81%8A%E5%9F%8E%E8%AE%BF%E8%B0%88%EF%BC%BA%EF%BC%B1&type=1 http://www.kingdee.com/?s=%E5%90%89%E6%9E%97%E9%80%9A%E5%8C%96%E7%9C%9F%E4%BA%BA%E5%B9%B3%E5%8F%B0%E5%AE%98%E6%96%B9%E7%BD%91%E5%9D%80TL262%C2%B7%EF%BC%A3%EF%BC%AF%EF%BC%AD%E9%81%B5%E4%B9%89%E8%AE%BF%E8%B0%88%EF%BC%BF

0
#4chenxiaoyu3回复于2017-10

qiyuhb:

已经解决了,是我音频的rate和type有问题

回复qiyuhb:

0
#3qiyuhb回复于2017-10

已经解决了,是我音频的rate和type有问题

0
#2qiyuhb回复于2017-10

又试了一下SDK,结果是一样的,是我的语音文件格式有问题吗?

from aip import AipSpeech

import json

import sys

filename=sys.argv[1]

cuid="xxxx"

api="xxx"

key="xxx"

aip=AipSpeech(cuid,api,key)

f=open(filename,'rb')


a=aip.asr(f.read(),'wav',8000,{'lan':'zh',})

print a

tex=a['result']

print tex[0].encode('utf-8')



{u'err_no': 0, u'corpus_no': u'6474908218597675149', u'err_msg': u'success.', u'result': [u'\u4e0d\u8fd8\uff0c'], u'sn': u'256617698821507557048'}

不还,



0
TOP
切换版块