본 문서는 음성을 텍스트로 변환해주는 음성인식(이하 STT: Speech to Text) 서비스를 사용하기 위해 설명한 문서입니다.
HTTP 기반의 REST API로 제공되며 인식에 사용할 언어와 음성 데이터를 입력 받고 그에 맞는 인식 결과를 텍스트로 반환합니다.
API에서 제공하는 기능은 각각 아래와 같습니다.
RestAPI 호출시 header에 공통으로 발급된 Client ID, Secret Key를 포함시켜야 합니다.
이중화된 서버 구성으로 초기화 시 시작된 세션을 유지시키며 음성데이터가 전달되어야 결과가 출력됩니다.
따라서 withCredentials Option을 true 값으로 넣어줘야 합니다.
JavaScript Ajax
$.ajax({
url: __url__,
type: 'POST',
dataType: "JSON",
data: __data__,
crossDomain: true,
contentType: "application/json",
beforeSend: function(xhr) {
xhr.setRequestHeader("API-KEY-ID", apiKeyId);
xhr.setRequestHeader("API-KEY", apiKey);
},
xhrFields: {
withCredentials: true
},
success: function(data) {
console.log(data);
},
error: function(request,status,error) {
console.log("code:"+request.status+"\n"+"message:"+request.responseText+"\n"+"error:"+error);
}
});
INPUT (Method=Post, (*)필수)
Parameter | Description | Example | ||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
modelId (*) | 음성 인식 모델 | 0 | ||||||||||||||||||||||||
sttType (*) |
음성 인식 유형
|
0 | ||||||||||||||||||||||||
useEpd (*) |
EPD 사용여부
|
0 | ||||||||||||||||||||||||
codec (*) |
음성 데이터 코덱
|
0 | ||||||||||||||||||||||||
startMargin | 음성이 검출된 시점 기준에서 앞으로 마진 값 | 0.5 | ||||||||||||||||||||||||
endMargin | 음성이 검츨된 시점 기준에서 뒤로 마진 값 | 0.7 | ||||||||||||||||||||||||
speakingIdx |
음성 비교평가 키
|
0 | ||||||||||||||||||||||||
evalSentence | 음성 비교평가, 문장 평가, 단어 검출 텍스트 | '' | ||||||||||||||||||||||||
midResult |
중간 결과값 전달
|
0 | ||||||||||||||||||||||||
pitchResult |
피치(음의 높낮이)값 전달
|
0 | ||||||||||||||||||||||||
ampResult |
진폭 값 전달
|
0 | ||||||||||||||||||||||||
evalResult |
발음평가 값 전달
|
0 | ||||||||||||||||||||||||
keywordList | 키워드 검색 리스트 (`|`로 구분) | 안녕|하세요|반갑습니다 | ||||||||||||||||||||||||
keywordModelId | 키워드 검색 모델 | -1 | ||||||||||||||||||||||||
userKey | 음성활동 사용자 고유번호 | DA264662-8CE3-4F3D-91AE-A2C1F45BD9E3 | ||||||||||||||||||||||||
optionValue | 음성활동 사용자 정보 |
정보(기타 추가 항목 문의) (※ 아래 예제 처럼 String 형태로 전송)
|
Input ex
{
"modelId":"0",
"useEpd":"1",
"codec":"1",
"midResult":"1",
"userKey":"DA264662-8CE3-4F3D-91AE-A2C1F45BD9E3",
"optionValue": "{\"age\":\"6\",\"level\":\"Y2\",\"subject\":\"영어\",\"unit\":\"CLSV01SJ020300040301\",\"answer\":\"도토리\"}"
}
OUTPUT
Parameter | Data Type | Description |
---|---|---|
sttId | String | 생성된 채널 고유 ID 값 |
errorCode | String | 에러 코드 (에러가 아닐 경우 보여주지 않음) |
errorMessage | String | 에러 메시지 (에러가 아닐 경우 보여주지 않음) |
Output ex
{
"sttId" : "X1230xdl123lsdf"
}
실시간 음성 데이터 인식을 위해 채널이 생성되고 생성된 채널 고유 ID 값을 리턴 합니다.
가용 음성인식 서버 채널이 없을 경우 429 – Too Many Request를 리턴 합니다.
prepare 호출 후 일정 시간 이상 음성 데이터 전달 혹은 종료 요청이 없을 경우 Timeout으로 종료 처리됩니다.
INPUT (Method=Post, (*)필수)
Parameter | Description | Example |
---|---|---|
sttId (*) | prepare에서 리턴 받은 채널 고유 ID값 | X1230xdl123lsdf |
data (*) |
Base64 인코딩 된 음성 데이터
|
|
dataIndex (*) | 클라이언트에서 보낸 데이터 인덱스 값 | 1 |
Input ex
{
"sttId":"X1230xdl123lsdf",
"dataIndex":1,
"data":"CgAJAAcABgAIAAcACwAIAAYACgAJAAUABQAHAAoACAAKAA0ACQALAAcABQAFAAMABwAGAAMAAAAEAAQAAgD+////AwADAAAABAAFAAYABwAGAAoABgAEAAIABQAHAAgABAD+/wEAAAAGAAYAAgACAAEAAQAAAP/////8//3/AQABAAIAAgADAAIABwAFAAYACgAKAA0ACwAJAAsADQAOABEAEgASABIAEwASABAADwAOABUAEQAPAA8ADQAQAA8ADgALAAgACAAEAAYABwAEAAEA/f8BAAEA/v/9//n//P/8//n/9v/2//f/+f/3//n/+//2//j/+f/4//v//P/8///////8////AgAAAAQABQABAAMAAgACAAQAAgADAAMABgAFAAcABgADAAgACgAKAAsACgAKAAgABwAKAAcAAwAGAAkACgAMAAsACgAKAAoABAADAAcAAwAHAAYACAAGAAMAAwACAAMAAAACAAAA/v8AAP///P/+/wAAAAAAAP3//v/+////BQAFAAQAAgAFAAkACQAIAAcABQALAAkACAAJAAsACgAOAA0ACgAPAA0ADgANAAwACwAJAAoACAAHAAkABwAEAAMAAwAAAAAAAgABAAAA///8//3/AAD9//r//P/4//r/+//5//v/+v/8//z//f////7/+v///wAA+////wAAAAACAAQABAAFAAMABQAGAAUABQAGAAgABwAHAAkADQAOAAkACwAJAAsADQALAA4ADgAQAA8ACgAPAA0ACgAOAAoACQAKAAYABQAHAAUABAAEAAMABgAJAAYABgAJAAcABgAFAAYACwAIAAUABwAIAAYACAAIAAcAAwAAAAIAAwAHAAcAAAABAAUACAAIAAcABQAGAAYABQAJAAcABwAHAAgACQAJAAsACQALAAwACwALAAsACAAHAAoACQAGAAUABAAEAAYABQAGAAIAAgADAAIABgAEAAcABgADAAcABgAFAAMABwAGAAAAAwAEAAIABgAGAAMAAwAAAAAAAwAEAAUABAAAAAAAAAAAAAAA/f8AAAMAAAD8//r///8BAAAABAAFAAMABgAIAAcACQAIAAUACAAIAAcABwAFAAYAAwAGAAoAAwAIAAcABQAHAAgABwAJAAkABwAHAAUABQAFAAQAAgABAAEAAQADAAMAAAAAAPz//P///wAA/f8BAAEABAAGAAAAAwAEAAQAAwAFAAUAAgAFAAQABgAGAAQAAQD//wQA/P///wEA+//+/wAAAAABAAEA/P/+/wAAAAABAAAAAAD//wAAAAAAAAIAAQAAAAMABQABAAAABQABAP//AgADAAQABQAFAAUAAQABAAMABgAEAAAABAAAAAIAAAACAAkABgADAAYACAAIAAoACgAIAAYACAAIAAUABAABAAQAAQAAAAAAAgACAAMABwAAAAEAAQACAAgAAQAAAAMAAAACAAIA//8DAAYAAgADAAYABgAFAAcACwALAAoABgAHAA0ADQANAAsACQANABAADQAJAAsACwAJAAgACAAIAAoABgACAAYABgAEAAoABQACAAMAAwAGAAcABgAIAAoACQAIAAoABwAJAAwACAAGAAkADAAKAA0AEAALABEADwAKAAwADAAJAAoACAABAAQACQABAAAABQAAAAEAAwAFAAIAAAAHAAYABgAHAAYACQAJAAkACgAJAAwACgAMAA0ACwAKAAoACgAHAAkABgAJAAkACgAHAAMAAAAAAAEABAAAAP3//P/5//j/+f/6//z/+f/5//3////9//3//////wAAAAAAAAEAAwAAAAAABwAEAAUABQAEAAYABwAEAAIABQAAAAAABAABAAMAAgAAAAIAAgADAPz//v/6//r//P/3//f/+f/5//z/+f/6//n//P/6//z//v/+/wEA//8CAAUABwAHAAYABgAHAAQABgAHAAgADAAHAAoABAAFAAYABwAIAAkACQACAAIACgAKAAcABwANAAkABwAIAAcABwAEAAAAAAAAAAAAAAAFAAQAAwD///3////8//7//f/5//v/+v/5//n////8//X//v/2//3/AwACAAUABQAFAAIACAAHAAcADQANABAADgAPABEAEwATABQAFAAQAA4ADwALAAsACgAGAAYABgAFAAQABAACAP7//v8AAP////8AAP///f8AAAEAAwABAAkABwAFAAwACwALAAsAEAAMAAsACgAGAAgADAAIAAsAEgALAA4ADQALAAwACgAFAAMAAgAAAPz/+//7//7/AAAAAAEAAwABAAMAAwAAAAMAAQACAAQACAAHAAoACwALAA8ADQAPABAAEQAOAAwACwALAAsACgAKAAsADQANAAwACwAKAAoADAALAAoACwAFAAUACwAOAAoABwANAAUABAAHAAYACQAHAAcABQACAAAABgD//wQAAAD//////P/3//n/9v/3//r/+v/1//3//f/5//7/AgD5//n///8CAAEAAQD7//r//P/+/wAAAwACAP3///8CAP//AgAAAP3////+/wAAAAAAAAMABgACAAgACgAGAAcABwACAAMABwD+/wAA///9//7//v/+//n/+//5//z////+/wAA+//8//z//f8FAAIAAQABAAgABwAKAA4ADwAYAA4AEAARABIAEwATABQAFQAQAA4AEAARABAAEwAPAAsACAAKAAgACAAKAP3/AwAAAAMABADy/wIA9P8CAAAA+/8GAPr/EQD1/xEA8f8IAP3/AgAOAPH/CAD0/wUABgAGABAAAQAHABAAAAAOAAwABgAMAA0AEAAEAAYACQABAAAAAwD3/wQA+/8CAP///f8GAPz/BAABAAAAAQAAAAgABwAEAAEABwACAAQAAQADAP//AgD6/wsA//8EAAIACAAKAAYADQAEABIAEAATABEAFQAMAAwADwARABAADQAOAAUABQAGAAoAAwAIAAUABQAIAAMACQADAAoAAQAHAAUABwAJAAUACQAHAAQAAQACAAAAAwAAAAYAAgAJAAcADAAHABUAFAAWABgAEwAWABAAGQARAA0AEAALAAcABgAEAAAAAQD7//z/AAD///r/AgD+/wQA/f8BAP3/AgD+/wQABAACAAUA//8HAP//CgD8/wEAAAD+/wQA/f8HAPf/AQAEAAYABgD+/wsAAAADAPr/+f8AAP7/CwAAAAEA/P/0/wEA+P8GAAgACAALAAUAEQAEABAABgAVAAsABwAIAAAABwD//wcAAgAEAAYACwAFAA4ACAAFAAAA//8BAAAAAQD//wMA/P/8//T/AgD0/////P8AAPn/+/////3/BwAAAAgAAAAKAAYADAAFAAEADgABAAEABgAKAAsACgALAAsAAwAAAAQABAAEAAYABgANAAUAAwAMAAMACAAKAAcA/v8IAAsABAAJAAMACAAEAAUACAAGAAMABAADAAAAAwACAAsAAAADAAgAAwAAAAUAAAAAAAAA+P/+//b/+//3//7//P/9//7/AAAGAAAACAADAAsAAwAKAA4ACAAHAAoACgAFAAYAAgADAAEABQAAAAAA+P/9/wIABAD//wsABwAHAAoAAgALAAIAEAAGAAwAAwAFAAoABgAIAAUABQADAAEABAAGAAkA"
},
{
"sttId":"X1230xdl123lsdf",
"dataIndex":2,
"data":"DQAHAAwACgATAAwADQAFAAgACQAEAAkAAAARAAMABQALAAYADgAGAA4ADQAKAAgACwAMAAwAEAAIAA8ABgAGAAoAAwADAAIAAgADAAAABAABAAIABAANAAMABQANAAYADAALAA8ADAALAAoACAAJAAMABwABAAAAAAAAAP3/AAD///z/AQAAAAAABgAFAAcABwAMAAAAAgAAAP//CwAIAAgACQAJAAkABwAKAAQADwAIAAkABwAEAAYAAQADAP3/AAABAP//AQAAAAMAAAACAP///v8AAAAAAAABAAQAAAAFAAkABQAFAAAA/v////7/AQD//wAAAwAAAAEAAwD+/wAA/f8CAAMAAQAJAAAAAgAAAAEA//8AAAAA/f////3/+v/+/wAAAAAEAA4AEQAMAA8ABAALAAsADgAOAAkACwAGAAcABAAGAAcACQACAAUAAQD//////f/+/wEAAwAFAAYAAgAAAP7/AAAAAAAA+f/7//r/+/8BAP///f/5//n//v/9//3/AAD//wAABAAGAAYACwAKAAUAAQD+/wAAAAADAAAA///+/wAABgABAAcAAAAAAAIAAQAGAAUACwAIAAsACQAKAAUACAAHAAQACAADAAUABAAFAAgACwAQABMAFgAYABYAGAATABUAEwAUABIADwASAAcABgAEAAcABAACAAUABAAKAAcACgAHAAkACAAFAAQAAQAFAAAAAAAAAAAABAADAAMAAwACAAUAAAABAAQABQAFAAcACQALAAgABQALAAYABAAGAAAAAQACAAMA/P8AAP3/+v8AAP//+//6//n/+f8AAAoACAAJAAcABQAEAAEABgAFAAUACQAJAAoACgANAAoADwATABIADwAWABMAEgARAA0AEAAMAA4ACAAHAAQABgALAAUABAAFAAYACAAEAAkABwAAAAMAAQD+////AAD9/wAA///6//7//f/+/////f8AAP3/AAADAAAAAAABAAEA///9/wAAAgAFAAQAAQADAAAA/P/9//7/+//3//n/+P/9//3/AAD8//f/+f/z//n//f8AAAIAAwADAAEAAgAEAAUABQAIAAkACQANAAkACwAKAA4AEAAQABIABwAIAAYACgAJAA4AEAAQABAACwALAAMABgADAAAA///9//v////9////AAD8//n//P/5//3/+P/5/wAAAAAFAAMACgABAAMAAQAEAAIA/f8EAAMAAwAGAAIABAABAAMAAAAEAAUABAAFAAUACQAEAAQA///8/wAA/f8FAP///v8BAAAABwAJABEAAwAMAAgABQADAA4AGwALABMADwAPAA4ACwAMAA4ADgAMABMAEQAHAA4ADgASAA8ADgANAA4ADAAJAAsACgATAAQABwAHAAAAAAAAAAMAAAD4//n/+//0//f/+/8BAAAA+//+/wAA9/8BAP3/AAAFAPv/CAADAAQABAAGAA4AAQACAA8A/////wkABwAMAAkADgANAA4ADAALABEADQAKABMAFQAMAA8ABAADAAUABgAJABIAEwAIAAgABwD7/wEADwASAAkAEAAFAAAAEgAIAAQAAgADAAYACgAIAAMADAAFABkAEAAIAAsACAAOAAUABwANAAwACgAFAAkAAgABAA0AAgAMAAUACgAEAAQAAQADAAYABQAMAAIACgAAAP//AQD5//j/+v8BAAYAAwACAP//+v/3//n/7//9/wAA9P8CAPn/AwD9////+f/4//f/8P/6/+P/+P/4/wIAFgAZABUAFAD2/9v/0f/Y/93/IgDv/yUAGgD8/w8ABgAOAPX/9//v/9n/6f8SANv/OgD5ALL/fAD9/6z/6v/H/wkAAwAcADIACAA5ABkAAAAQAAQA5v/i//H/9v8EAPT/FQASAPT/BgD2//7/6P8gAPD//v8EAO7/AgDy/w4A7f8QAA8ACgANAA8ADAAOABkA7v8AAAgA4P8cAPP/FgAGACQALgAPABgA7P8JABYAAgAIABkA9//2/zAA//8fADsAEAAPAAwA+v/z/xMAIADr/yYAGgD7/wcACQDz/xcAEgD3/woAAgDw//X/EgD2/9j/IwAMAP//JwD1/+b/DQDt/wQABwAIAP7/BQAYAPj/EgATAAEAFQD2//n/CAAHABwAFQAeAAUADAD3//P/AAD3////8P8EAOj/CgD0//r/FQD7/wQA+P8LAOf/+P8GAPv/CgAPABMABQAgABMAAAAVAPb/6f8BAAoABQAcAB8AEgArAAAAAAAEAAAABAAAAAkAFwATABsACwAHABQAAAAPAA0A//8AAA4AAAADABIA9f8VAA4ABgAIABYAFwAWABkAEAD//xgABAD0/wgADAABAPr/EADr//r/DwAQAPD/AgAAAOT/DAD+//z//v8BAAMA///1//7///8AABAA/P8DAPf/7P/4/9b/5v/9//n/GAAoAAIABgAAAPT//f/6//b/6P/Z/+L/zv8cAJQAggB2AE0Awf+b/5H/nv9FAKEAmgCcACcAz//J/7P/HwAyACkAVQARAPP/+P/V/xcAKQALADYA7f/h//f/4v8RAAYA8v8AANH/3P/p/8X/8v9HAAQA6v8GAMn/0//9/yAADQDg/wIAzv/E/xEAAAALABsA+P/o/9X/6v8OAA0AHgAUAOz/8P/5/+P/AwD///b/7v/x/+r/5f8LAAQA6v/6//j/6v/o//n/AgD7/wMADQAOAAAAEQAJAAEAEAD4/wgAEwAPAAYABQD//wIADQAkABsA//8EABIADwAdAAgACAAVAAAAEgAfAP//DgAAAPH/9/8BAAgAAAAGABMA/f8MAC0AIAAHAAAA/P8RABsAFQAdAPb/+/8DAAIAEgANABUAFQAFAAAABQAOABMACwD5/woADQAaADQAKwAsADUAGwAhAB8A8v8KABIAHAA6ACUAGwAoAA4AEgAPAAAACwAIABoAEAAGAB4AGQAWAAUABgAKAPP/AQDz//H/CAADAAAADQAOAPj/CgAAAPL/CgAIAA4AEwD6/wUAAAAAAA8A8/8DAAcA/v8gABwABwANAPL/5f/U/+X/AQAAABwACAAEAAIA5f/w/+3/8v/u//L/AADn//f//P/x/wUA+f/u/+z/7//+//j/DAAJAAEADQD7//z//P8BAPn/7P////z/5//5/+//+P8HAPv/4f/m//j/AwAFABEADwD2/+z/AwAIAA0AGAALAAUA+v8EABcAHQAaABMAAAACAAcAKgA5ACIAEQAFAA8AEgAZACsAJAAPAA0ACAAUABcAHQAMAAQABQALABMAHAASAAIA9f/+/wMABAARAAcA9//0//L//f8CAP7/+f/p//P/+v/3/wcAFAABAAMA/f///wQAAAD8//v/8f/z//n/CgD///v/AADz/wEABgD6/wAACAAAAAMAEgAOABwAGQAPABMAAwAKAAsADwANAAAA/v/5/wAA//8HAA4ABQAAAAMA8P/9/wAABQAAAPX/+f/w//T/AgD3//b/CQABAPX/6P/r/+///f8SABAAAAD3/+3/9f8JABQAEQADAAgA/v8DAAoABAD//xcADAD3/xUA///v/w4AGgAWAB0AJQAUAAoAIAAbACcAMQAjABEADAAdACsAOAAoAAkAAwAZABUA"
}
OUTPUT
Parameter | Data Type | Description | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
sttId | String | 입력된 채널 고유 ID 값 | |||||||||
analysisResult | Node |
인식 결과
|
|||||||||
errorCode | String | 에러 코드 (에러가 아닐 경우 보여주지 않음) | |||||||||
errorMessage | String | 에러 메시지 (에러가 아닐 경우 보여주지 않음) |
Output ex
{
"sttId": "X1230xdl123lsdf",
"analysisResult": {
"progressCode": "P02",
"dataIndex": "1",
"midResult": "안"
}
},
{
"sttId": "X1230xdl123lsdf",
"analysisResult": {
"progressCode": "P02",
"dataIndex": "2",
"midResult": "안녕"
}
}
prepare 호출 시 midResult를 선택한 경우에만 음성인식 중간 결과값을 받을 수 있으며 유효한 음성데이터가 없을 경우 중간 결과값이 존재하지 않을 수 있습니다.
인식 결과에서 음성인식 진행 상황(progressCode)를 수신한 후 P03(끝 음 검출)이나 P05(끝 음 검출 시간 초과)를 수신 받을 경우 finish를 호출해야 합니다.
P01(인식 진행 중)이나 P02(첫 음 검출)일 경우는 계속해서 데이터를 보내주시면 됩니다.
INPUT (Method=Post, (*)필수)
Parameter | Description | Example |
---|---|---|
sttId (*) | prepare에서 리턴 받은 채널 고유 ID값 | X1230xdl123lsdf |
Input ex
{
"sttId":"X1230xdl123lsdf"
}
OUTPUT
Parameter | Data Type | Description | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
sttId | String | 입력된 채널 고유 ID 값 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
analysisResult | Node |
인식 결과
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
kwsResult | Node |
키워드 검색 결과
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
evaluationWord | Node |
말하기 평가 결과 (발음, 억양, 강세)
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
evaluationWordSil | Node |
말하기 평가 결과 (리듬&휴지)
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
errorCode | String | 에러 코드 (에러가 아닐 경우 보여주지 않음) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
errorMessage | String | 에러 메시지 (에러가 아닐 경우 보여주지 않음) |
Output ex
{
"sttId": "Kzyzg7ir4OQBcBWb",
"analysisResult": {
"progressCode": "P03",
"dataIndex": "85",
"result": "I sat down my bed ",
"score": "0.0",
"totalStartTime": "0.19",
"totalEndTime": "3.49",
"syllablePerSec": "11.515",
"wordResult": [
{
"token": "<eps>",
"score1": 4.806,
"score2": 85.899,
"startTime": 0.19,
"endTime": 0.67,
"realAmplitude": 273.0,
"normalAmplitude": 17.449,
"phonemeResult": [
{
"duration": 0.48,
"score": 0.0,
"phoneme": "SIL"
}
],
"pitchesValue": [
101.99032,
101.99032,
101.99032,
101.48291,
101.48291,
101.48291,
101.48291,
101.48291,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
101.99032,
102.500275,
103.01278,
103.01278,
103.01278,
103.01278,
103.01278,
103.01278,
103.01278,
103.01278,
103.01278,
103.01278,
103.52785,
103.52785,
103.52785,
104.04549,
104.56571,
105.08854,
105.613976,
106.142044,
106.67275,
107.74214,
108.28085,
107.74214,
107.206116
]
},
{
"token": "I",
"score1": 3.718,
"score2": 49.089,
"startTime": 0.67,
"endTime": 0.94,
"realAmplitude": 16705.0,
"normalAmplitude": 1179.775,
"phonemeResult": [
{
"duration": 0.27,
"score": 0.0,
"phoneme": "AY_S"
}
],
"pitchesValue": [
107.206116,
105.613976,
107.74214,
109.36636,
108.82226,
117.275856,
114.387436,
115.53418,
107.74214,
101.99032,
100.47564,
98.983444,
98.491,
98.491,
98.491,
98.000984,
98.000984,
98.000984,
98.000984,
98.983444,
99.47836,
99.97575,
99.47836,
98.983444,
98.000984,
96.545555,
95.11173,
94.1677
]
},
{
"token": "sat",
"score1": 3.196,
"score2": 41.949,
"startTime": 0.94,
"endTime": 1.33,
"realAmplitude": 5071.0,
"normalAmplitude": 561.62,
"phonemeResult": [
{
"duration": 0.18,
"score": 0.0,
"phoneme": "S_B"
},
{
"duration": 0.12,
"score": 0.0,
"phoneme": "AE_I"
},
{
"duration": 0.09,
"score": 0.0,
"phoneme": "T_E"
}
],
"pitchesValue": [
94.1677,
94.1677,
95.11173,
96.06523,
97.02828,
97.51342,
98.000984,
98.491,
98.983444,
99.47836,
100.47564,
101.99032,
103.52785,
105.08854,
106.67275,
107.74214,
108.28085,
109.9132,
112.128006,
116.11184,
112.688644,
111.57016,
113.25209,
114.95937,
116.11184,
116.6924,
119.04381,
120.83841,
121.442604,
121.442604,
120.83841,
120.23722,
119.04381,
117.862236,
117.275856,
116.11184,
114.95937,
113.81835,
112.688644,
111.57016
]
},
{
"token": "<eps>",
"score1": 0.0,
"score2": 0.0,
"startTime": 1.33,
"endTime": 1.48,
"realAmplitude": 14.0,
"normalAmplitude": 3.498,
"phonemeResult": [
{
"duration": 0.15,
"score": 0.0,
"phoneme": "SIL"
}
],
"pitchesValue": [
111.57016,
110.46277,
109.36636,
108.28085,
107.206116,
106.142044,
105.613976,
105.08854,
104.56571,
104.04549,
103.52785,
103.01278,
102.500275,
101.99032,
101.48291,
100.97801
]
},
{
"token": "down",
"score1": 10.229,
"score2": 122.727,
"startTime": 1.48,
"endTime": 1.96,
"realAmplitude": 15368.0,
"normalAmplitude": 574.366,
"phonemeResult": [
{
"duration": 0.12,
"score": 0.0,
"phoneme": "D_B"
},
{
"duration": 0.18,
"score": 0.0,
"phoneme": "AW_I"
},
{
"duration": 0.18,
"score": 0.0,
"phoneme": "N_E"
}
],
"pitchesValue": [
100.97801,
100.47564,
99.97575,
99.47836,
98.983444,
98.000984,
98.491,
105.613976,
106.142044,
106.142044,
98.000984,
97.02828,
96.06523,
95.11173,
94.63854,
93.6992,
94.1677,
94.1677,
92.769196,
93.23304,
93.23304,
92.769196,
91.84841,
90.93677,
90.03418,
90.93677,
90.484344,
91.39146,
91.39146,
91.84841,
94.1677,
96.06523,
97.51342,
98.491,
98.983444,
99.47836,
99.97575,
100.47564,
100.97801,
101.48291,
101.99032,
101.99032,
102.500275,
103.52785,
103.52785,
101.99032,
101.48291,
103.01278,
104.04549
]
},
{
"token": "<eps>",
"score1": 0.0,
"score2": 0.0,
"startTime": 1.96,
"endTime": 1.99,
"realAmplitude": 707.0,
"normalAmplitude": 230.072,
"phonemeResult": [
{
"duration": 0.03,
"score": 0.0,
"phoneme": "SIL"
}
],
"pitchesValue": [
104.04549,
104.56571,
106.142044,
107.206116
]
},
{
"token": "<unk>",
"score1": 0.0,
"score2": 0.0,
"startTime": 1.99,
"endTime": 2.23,
"realAmplitude": 4367.0,
"normalAmplitude": 500.093,
"phonemeResult": [
{
"duration": 0.24,
"score": 0.0,
"phoneme": "spn_S"
}
],
"pitchesValue": [
107.206116,
107.206116,
106.142044,
105.613976,
104.56571,
103.52785,
102.500275,
103.01278,
104.56571,
105.08854,
106.142044,
105.613976,
105.613976,
106.67275,
106.67275,
105.613976,
104.04549,
98.491,
95.11173,
94.63854,
94.1677,
93.23304,
93.23304,
93.6992,
94.1677
]
},
{
"token": "my",
"score1": 0.0,
"score2": 0.0,
"startTime": 2.23,
"endTime": 2.47,
"realAmplitude": 5713.0,
"normalAmplitude": 679.121,
"phonemeResult": [
{
"duration": 0.12,
"score": 0.0,
"phoneme": "M_B"
},
{
"duration": 0.12,
"score": 0.0,
"phoneme": "AY_E"
}
],
"pitchesValue": [
94.1677,
95.11173,
95.587296,
96.06523,
96.06523,
96.06523,
95.587296,
95.11173,
94.63854,
94.1677,
95.587296,
95.587296,
95.11173,
94.63854,
94.63854,
94.1677,
93.6992,
94.63854,
94.63854,
95.11173,
95.11173,
97.02828,
98.983444,
100.97801,
103.01278
]
},
{
"token": "bed",
"score1": 1.44,
"score2": 0.0,
"startTime": 2.47,
"endTime": 2.98,
"realAmplitude": 8240.0,
"normalAmplitude": 300.651,
"phonemeResult": [
{
"duration": 0.09,
"score": 0.0,
"phoneme": "B_B"
},
{
"duration": 0.12,
"score": 0.0,
"phoneme": "EH_I"
},
{
"duration": 0.3,
"score": 0.0,
"phoneme": "D_E"
}
],
"pitchesValue": [
103.01278,
105.08854,
107.74214,
110.46277,
109.9132,
107.74214,
104.04549,
101.48291,
98.491,
96.06523,
94.63854,
94.1677,
92.307655,
90.03418,
89.14054,
87.816696,
85.65383,
85.2277,
86.0821,
86.51251,
86.51251,
85.2277,
83.96195,
84.38177,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367
]
},
{
"token": "<eps>",
"score1": 0.0,
"score2": 0.0,
"startTime": 2.98,
"endTime": 3.49,
"realAmplitude": 13.0,
"normalAmplitude": 2.725,
"phonemeResult": [
{
"duration": 0.51,
"score": 0.0,
"phoneme": "SIL"
}
],
"pitchesValue": [
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367
]
}
]
},
"kwsResult": [
],
"evaluationWord": [
{
"token": "I",
"similar": "0.478",
"refPitch": [
204.00536,
208.11617,
212.3098,
216.58794,
220.95229,
225.40457,
231.0963,
227.66423,
232.25179,
235.75302,
241.70609,
245.34984,
247.80946,
250.29376,
251.54523,
252.80295,
254.06696,
252.80295,
252.80295
],
"userPitch": [
107.206116,
106.622055,
109.280876,
112.38167,
114.69148,
112.66342,
102.8985,
100.004425,
98.594666,
98.491,
98.12994,
98.000984,
98.000984,
99.061584,
99.79251,
99.42626,
98.41465,
96.470085,
94.61487
],
"refAmplitude": "3897.902",
"userAmplitude": "1179.775",
"refStart": "0.24",
"userStart": "0.67",
"refEnd": "0.42",
"userEnd": "0.94",
"refScore": "100.0",
"userScore": "100.0"
},
{
"token": "sat",
"similar": "0.96",
"refPitch": [
252.80295,
251.54523,
251.54523,
252.80295,
255.3373,
257.89706,
260.48248,
263.0938,
265.73132,
268.3953,
271.08594,
273.8036,
276.54846,
279.32086,
282.12106,
284.94934,
287.80597,
292.14468,
295.07343,
290.69122,
289.245,
287.80597,
287.80597,
289.245,
292.14468,
296.5488,
302.52438,
310.16348,
319.5855,
326.02536,
330.94022,
335.9292,
339.2969,
335.9292,
329.29376,
322.78937,
316.41345,
310.16348,
304.03702,
298.03156,
292.14468,
284.94934,
277.9312,
271.08594,
265.73132,
260.48248
],
"userPitch": [
94.1677,
94.1677,
94.86546,
95.69212,
96.52582,
97.19702,
97.619415,
98.043594,
98.46969,
98.8978,
99.327736,
100.04204,
101.1342,
102.45826,
103.79927,
105.157425,
106.534996,
107.50967,
108.093475,
109.13251,
110.77986,
113.16727,
115.66533,
112.688644,
111.71605,
112.813324,
114.291306,
115.51056,
116.31377,
117.203575,
119.19986,
120.76038,
121.337524,
121.4426,
121.101105,
120.57703,
119.87401,
118.83832,
117.83674,
117.32684,
116.36488,
115.36023,
114.36406,
113.3763,
112.396866,
111.57016
],
"refAmplitude": "2999.289",
"userAmplitude": "561.62",
"refStart": "0.42",
"userStart": "0.94",
"refEnd": "0.87",
"userEnd": "1.33",
"refScore": "100.0",
"userScore": "59.099"
},
{
"token": "down",
"similar": "0.707",
"refPitch": [
260.48248,
255.3373,
245.34984,
239.30704,
231.0963,
225.40457,
222.05704,
219.85303,
218.75923,
217.67088,
217.67088,
217.67088,
218.75923,
218.75923,
218.75923,
218.75923,
219.85303,
220.95229,
220.95229,
222.05704,
223.16733,
224.28316,
225.40457,
227.66423,
227.66423,
227.66423,
224.28316,
219.85303,
216.58794,
213.37135,
210.20253,
206.05052,
204.00536,
200.97563,
198.98085,
197.99089,
197.00586
],
"userPitch": [
100.97801,
100.31351,
99.65312,
98.99682,
98.14667,
102.91879,
106.1135,
103.94176,
97.42262,
96.14332,
94.99663,
94.10541,
94.11706,
93.86532,
93.01992,
93.23304,
92.59499,
91.380264,
90.18055,
90.8634,
90.925644,
91.39146,
92.16183,
95.039536,
97.2003,
98.544235,
99.197464,
99.85477,
100.51637,
101.1827,
101.85318,
102.01788,
102.889084,
103.52785,
101.9766,
102.02043,
103.71055
],
"refAmplitude": "5189.778",
"userAmplitude": "574.366",
"refStart": "0.87",
"userStart": "1.48",
"refEnd": "1.23",
"userEnd": "1.96",
"refScore": "100.0",
"userScore": "100.0"
},
{
"token": "on",
"similar": "0.529",
"refPitch": [
197.00586,
197.00586,
197.00586,
197.99089,
198.98085,
199.97575,
201.98051,
202.9904,
202.9904,
202.9904,
204.00536,
204.00536,
205.02539,
205.02539,
205.02539,
206.05052
],
"userPitch": [
107.206116,
106.607574,
105.48295,
103.85217,
102.6284,
104.274536,
105.483604,
105.64698,
106.143364,
106.60658,
104.63367,
97.85738,
94.756836,
93.87562,
93.23304,
93.904175
],
"refAmplitude": "6252.867",
"userAmplitude": "500.093",
"refStart": "1.23",
"userStart": "1.99",
"refEnd": "1.38",
"userEnd": "2.23",
"refScore": "100.0",
"userScore": "70.0"
},
{
"token": "my",
"similar": "0.467",
"refPitch": [
206.05052,
207.08076,
208.11617,
207.08076,
206.05052,
204.00536,
202.9904,
201.98051,
201.98051,
202.9904,
204.00536,
205.02539,
205.02539,
204.00536,
201.98051,
199.97575
],
"userPitch": [
94.1677,
95.379234,
96.06523,
96.06523,
95.46841,
94.727264,
94.70005,
95.587296,
94.87514,
94.609116,
93.87489,
94.63854,
94.99344,
95.71065,
98.739044,
101.868225
],
"refAmplitude": "6797.068",
"userAmplitude": "679.121",
"refStart": "1.38",
"userStart": "2.23",
"refEnd": "1.53",
"userEnd": "2.47",
"refScore": "100.0",
"userScore": "100.0"
},
{
"token": "bed",
"similar": "0.848",
"refPitch": [
199.97575,
199.97575,
200.97563,
201.98051,
202.9904,
204.00536,
205.02539,
207.08076,
202.9904,
199.97575,
197.00586,
195.05048,
194.08008,
193.1145,
192.15373,
190.24652,
188.35823,
185.56088,
180.99063,
175.65468,
168.78395,
163.80789,
158.18758,
155.83829,
152.76009,
150.49142,
148.99771,
148.25644,
147.51884,
146.78491,
146.05464,
146.05464,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328,
145.328
],
"userPitch": [
103.01278,
105.43466,
108.45187,
110.24772,
108.78047,
105.331276,
102.03999,
98.75117,
96.0032,
94.556656,
93.6016,
91.31918,
89.52908,
88.219604,
86.02998,
85.24623,
86.11953,
86.51251,
86.06562,
84.622345,
84.21749,
84.69361,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367,
84.80367
],
"refAmplitude": "3708.395",
"userAmplitude": "300.651",
"refStart": "1.53",
"userStart": "2.47",
"refEnd": "1.98",
"userEnd": "2.98",
"refScore": "100.0",
"userScore": "100.0"
}
],
"evaluationWordSil": [
{
"refToken": "I",
"userToken": "I",
"refSilience": "0",
"userSilience": "0",
"refStart": "0.24",
"userStart": "0.67",
"refEnd": "0.42",
"userEnd": "0.94"
},
{
"refToken": "sat",
"userToken": "sat",
"refSilience": "0",
"userSilience": "0",
"refStart": "0.42",
"userStart": "0.94",
"refEnd": "0.87",
"userEnd": "1.33"
},
{
"refToken": "down",
"userToken": "down",
"refSilience": "0",
"userSilience": "0",
"refStart": "0.87",
"userStart": "1.48",
"refEnd": "1.23",
"userEnd": "1.96"
},
{
"refToken": "on",
"userToken": "on",
"refSilience": "0",
"userSilience": "0",
"refStart": "1.23",
"userStart": "1.99",
"refEnd": "1.38",
"userEnd": "2.23"
},
{
"refToken": "my",
"userToken": "my",
"refSilience": "0",
"userSilience": "0",
"refStart": "1.38",
"userStart": "2.23",
"refEnd": "1.53",
"userEnd": "2.47"
},
{
"refToken": "bed",
"userToken": "bed",
"refSilience": "0",
"userSilience": "0",
"refStart": "1.53",
"userStart": "2.47",
"refEnd": "1.98",
"userEnd": "2.98"
}
]
}
finish 호출 시 서버에서 채널을 종료시키며 음성인식 결과를 리턴 합니다.
단어인식 결과(wordResult) 중 묵음(발성이 없는 구간) 및 등록되지 않은 단어의 경우 아래와 같이 값을 반환합니다.
HTTP Status Code | Error Message | Description |
---|---|---|
400 | Bad Request | 잘못된 입력 파라메터 |
401 | Unauthorized | 인증 실패 |
409 | Conflict | 중복 호출 |
429 | Too Many Request | 가용 음성인식 채널이 없음 |
500 | Internal Server Error | 음성인식 중 오류 발생 || 알 수 없는 오류 발생 |
Error Code | Error Message | Description |
---|---|---|
400 | 필수 파라메터가 들어오지 않았습니다. | 필수로 들어와야 하는 파라메터가 들어오지 않았습니다. |
400 | 잘못된 순서입니다. | 데이터(dataIndex)가 순서대로 들어오지 않았습니다. |
401 | 인증에 실패하였습니다. | API 인증정보 확인이 필요합니다. |
409 | 중복 호출 되었습니다.. | Finish가 중복 호출 되었습니다. |
429 | Too Many Request | 가용 음성인식 채널이 없음 |
500 | 내부 서버 오류가 발생하였습니다. | 음성인식 중 오류 발생 or 알 수 없는 오류 발생 |
500 | 음성 파일 저장에 실패하였습니다. | 클라우드 스토리지에 문제가 발생하여 업로드가 실패하였습니다. |
500 | 클라이언트 채널이 존재하지 않습니다. | Prepare 없이 Finish를 호출하였거나 일정시간이 지난 후 Finish를 호출한 경우 발생합니다. |
502 | 시간이 종료되었습니다. | 일정시간동안 데이터가 들어오지 않아 생성된 채널이 종료되었습니다. |
502 | 엔진 로딩에 실패하였습니다. | 엔진이 서버에 로딩이 완료되지 않았습니다. (서버 재시작 시 1분 정도의 로딩 시간이 필요합니다.) |