python使用百度語音識別API注意事項

大森林home發表於2017-11-08

程式碼如下：

# -*- coding:utf-8 -*-
#http://blog.csdn.net/happen23/article/details/45821697
#百度語音識別API的使用樣例（python實現）
#encoding=utf-8



import wave
import urllib, urllib2, pycurl
import base64
import json
## get access token by api key & secret key

def get_token():
    apiKey = "xjlSpsvUgGF8a9ltNOtREoTr"
    secretKey = "a95ca71b81854b526e7eb04ae8f51d23"

    auth_url = "https://openapi.baidu.com/oauth/2.0/token?grant_type=client_credentials&client_id=" + apiKey + "&client_secret=" + secretKey;

    res = urllib2.urlopen(auth_url)
    json_data = res.read()
    return json.loads(json_data)['access_token']

def dump_res(buf):
    print buf


## post audio to server
def use_cloud(token):
    fp = wave.open('8k.wav', 'rb')
    nf = fp.getnframes()
    f_len = nf * 2
    audio_data = fp.readframes(nf)

    cuid = "xxxxxxxxxx" #my xiaomi phone MAC
    srv_url = 'http://vop.baidu.com/server_api' + '?cuid=' + cuid + '&token=' + token
    http_header = [
        'Content-Type: audio/pcm; rate=8000',
        'Content-Length: %d' % f_len
    ]

    c = pycurl.Curl()
    c.setopt(pycurl.URL, str(srv_url)) #curl doesn't support unicode
    #c.setopt(c.RETURNTRANSFER, 1)
    c.setopt(c.HTTPHEADER, http_header)   #must be list, not dict
    c.setopt(c.POST, 1)
    c.setopt(c.CONNECTTIMEOUT, 30)
    c.setopt(c.TIMEOUT, 30)
    c.setopt(c.WRITEFUNCTION, dump_res)
    c.setopt(c.POSTFIELDS, audio_data)
    c.setopt(c.POSTFIELDSIZE, f_len)
    c.perform() #pycurl.perform() has no return val

if __name__ == "__main__":
    token = get_token()
    use_cloud(token)

以上程式碼是不能直接跑的，一般錄音得到的音訊檔案，該程式碼下執行不了，目前只知道百度提供的音訊檔案可以識別，百度提供的音訊檔案下載連結如下：

http://yuyin.baidu.com/docs/asr/54

在上面這個連結的頁面中，往下拖，可以得到下載連結如下：
http://speech-doc.gz.bcebos.com/rest-api-asr/public_audio/public.zip

然後執行結果：

{"corpus_no":"6485972281376050071","err_msg":"success.","err_no":0,"result":["北京科技館，"],"sn":"651708407021510133099"}

注意事項：

百度的語音識別和語音合成用的是相同的

appid、API key和Secret Key，所以使用相同的token即可

獲取以上三個欄位的教程：

http://jingyan.baidu.com/article/f3e34a12df0cddf5eb65359f.html

後記:

下面的程式碼可以執行任意自己錄製的音訊檔案，注意，執行前必須apt-get install ffmpeg

另外，rate改成了16000，不然會識別不準，不過，也沒有經過大量測試，不知道識別準確的情況還會不會出現。

# -*- coding:utf-8 -*-
#http://blog.csdn.net/happen23/article/details/45821697
#百度語音識別API的使用樣例（python實現）
#encoding=utf-8



import wave
import urllib, urllib2, pycurl
import base64
import subprocess
import json
## get access token by api key & secret key

def get_token():
    apiKey = "xjlSpsvUgGF8a9ltNOtREoTr"
    secretKey = "a95ca71b81854b526e7eb04ae8f51d23"

    auth_url = "https://openapi.baidu.com/oauth/2.0/token?grant_type=client_credentials&client_id=" + apiKey + "&client_secret=" + secretKey;

    res = urllib2.urlopen(auth_url)
    json_data = res.read()
    return json.loads(json_data)['access_token']

def dump_res(buf):
    print buf


## post audio to server
def use_cloud(token):
    subprocess.call(['ffmpeg', '-i', 'tian.mp3', 'tian.wav'])#這句程式碼的意思是在終端中執行[]中的命令。所以執行的前提是apt-get install ffmpeg
    fp = wave.open('tian.wav', 'rb')
    nf = fp.getnframes()
    f_len = nf * 2
    audio_data = fp.readframes(nf)

    cuid = "xxxxxxxxxx" #my xiaomi phone MAC
    srv_url = 'http://vop.baidu.com/server_api' + '?cuid=' + cuid + '&token=' + token
    http_header = [
        'Content-Type: audio/pcm; rate=16000',
        'Content-Length: %d' % f_len
    ]

    c = pycurl.Curl()
    c.setopt(pycurl.URL, str(srv_url)) #curl doesn't support unicode
    #c.setopt(c.RETURNTRANSFER, 1)
    c.setopt(c.HTTPHEADER, http_header)   #must be list, not dict
    c.setopt(c.POST, 1)
    c.setopt(c.CONNECTTIMEOUT, 30)
    c.setopt(c.TIMEOUT, 30)
    c.setopt(c.WRITEFUNCTION, dump_res)
    c.setopt(c.POSTFIELDS, audio_data)
    c.setopt(c.POSTFIELDSIZE, f_len)
    c.perform() #pycurl.perform() has no return val

if __name__ == "__main__":
    token = get_token()
    use_cloud(token)

所謂的離線資源下載，其實仍然是本地向伺服器請求，效率上是無法提高的。

語音識別中使用Cool Edit Pro的使用注意事項
2017-01-03
百度API---語音識別
2020-12-19
API
Python 百度語音識別與合成REST API及ffmpeg使用
2017-05-31
PythonRESTAPI
用python呼叫百度語音識別api批量處理本地語音檔案
2020-11-08
PythonAPI
百度語音識別cordova外掛
2018-02-01
安裝百度語音識別sdk
2017-11-08
【JAVA】使用百度語音識別 Rest API，遇到識別結果顯示亂碼的問題和解決
2020-12-18
JavaRESTAPI
.Net Core使用HttpClient請求Web API注意事項
2018-07-18
HTTPclientWebAPI
C 語言位域使用及其注意事項
2016-12-18
使用parallel注意事項
2016-11-07
Parallel
OCR身份證識別軟體拍攝注意事項
2019-10-31
Python Enum 使用的幾點注意事項
2022-02-22
Python
SQL 語句的注意事項
2022-11-24
SQL
Python語音識別終極指南
2018-04-11
Python
ASR-使用whisper語音識別
2024-10-23
Python——常見注意事項
2024-03-23
Python
使用Google Fonts注意事項
2021-10-24
Go
Go 切片使用注意事項
2018-01-27
Go
使用CocosBuilder注意事項
2012-12-28
UI
removeChild使用時注意事項
2009-05-06
REM
Oracle使用*的注意事項
2024-03-06
Oracle
MySQL常用語句及注意事項
2017-04-30
MySql
[譯] 使用 WFST 進行語音識別
2019-05-12
MySQL型別轉換注意事項
2014-03-07
MySql型別
Go語言中 defer 使用場景及注意事項，你是要注意的！
2022-01-19
Go
【知識分享】windows伺服器使用有哪些注意事項
2023-03-25
Windows伺服器
TCP使用注意事項總結
2019-06-27
TCP
C中memcpy使用注意事項
2018-10-26
memcpy
萬兆網路卡使用注意事項
2022-08-26
MySQL半同步使用注意事項
2022-07-07
MySql
Guava HashMultimap使用及注意事項
2022-05-26
Guava
setbuf函式使用注意事項
2016-07-10
函式
php getallheaders使用注意事項
2014-06-18
PHPHeader
使用直方圖注意事項
2015-02-25
直方圖
python語音識別入門及實踐
2018-07-16
Python
【Android】 Android使用Java 8 語言功能注意事項
2019-02-25
AndroidJava
關於COMMIT和ROLLBACK語句的使用注意事項
2007-08-30
MIT
jQuery 語法總結和注意事項
2016-06-17
jQuery

python使用百度語音識別API注意事項

相關文章