Google api Google云文本到语音-Timepoint返回空数组

Google api Google云文本到语音-Timepoint返回空数组,google-api,google-cloud-functions,text-to-speech,Google Api,Google Cloud Functions,Text To Speech,我正在使用并且希望使用时间点,以便在正确的时间显示句子中的单词。(如字幕)。不幸的是,我不能让这个工作 HTTP请求 POST https://texttospeech.googleapis.com/v1beta1/text:synthesize 请求正文 "input": { "ssml": "<speak>Hello World</speak>" }, "voice

我正在使用并且希望使用时间点,以便在正确的时间显示句子中的单词。(如字幕)。不幸的是,我不能让这个工作

HTTP请求

POST https://texttospeech.googleapis.com/v1beta1/text:synthesize
请求正文

  "input": {
      "ssml": "<speak>Hello World</speak>"
      },
  "voice": {
    "languageCode": "nl-NL",
    "name": "nl-NL-Wavenet-E",
    "ssmlGender": "FEMALE"
  },
  "audioConfig": {
    "audioEncoding": "MP3"
  },
  "enableTimePointing": [
    "SSML_MARK"
  ]
}
{
    "audioContent": "base64"
    "timepoints": [],
    "audioConfig": {
        "audioEncoding": "MP3",
        "speakingRate": 1,
        "pitch": 0,
        "volumeGainDb": 0,
        "sampleRateHertz": 24000,
        "effectsProfileId": []
    }
}

我希望返回一个Timepoint对象,但正如您所看到的,它返回一个空数组。

要获取timepoints,只需在输入上添加
。下面是一个使用请求主体的示例

请求机构:

{
  "input": {
      "ssml": "<speak><mark name=\"1st\"/>Hello <mark name=\"2nd\"/>world</speak>"
      },
  "voice": {
    "languageCode": "nl-NL",
    "name": "nl-NL-Wavenet-E",
    "ssmlGender": "FEMALE"
  },
  "audioConfig": {
    "audioEncoding": "MP3"
  },
  "enableTimePointing": [
    "SSML_MARK"
  ]
}
{
“输入”:{
“ssml”:“你好,世界”
},
“声音”:{
“语言代码”:“nl nl”,
“名称”:“nl-nl-Wavenet-E”,
“女性”:“女性”
},
“audioConfig”:{
“音频编码”:“MP3”
},
“enableTimePointing”:[
“SSML_标记”
]
}
我添加了
来创建两个标记,只是为了演示如何添加多个标记。如果您只需要一个标记,只需删除第二个标记,响应也应显示一个标记

响应(我刚刚包括base64的一个片段):