The natural language speech audio to be processed.
A single request can contain up to 2 minutes of speech audio data.
The [transcribed
text][google.cloud.dialogflow.cx.v3.QueryResult.transcript] cannot contain
more than 256 bytes.
For non-streaming audio detect intent, both config and audio must be
provided.
For streaming audio detect intent, config must be provided in
the first request and audio must be provided in all following requests.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[[["\u003cp\u003eThis webpage provides reference documentation for the \u003ccode\u003eAudioInput\u003c/code\u003e class within the Google Cloud Dialogflow CX v3 API, with the latest version being 2.23.0.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eAudioInput\u003c/code\u003e class, part of the \u003ccode\u003eGoogle.Cloud.Dialogflow.Cx.V3\u003c/code\u003e namespace, is used to represent and process natural speech audio.\u003c/p\u003e\n"],["\u003cp\u003eThe class \u003ccode\u003eAudioInput\u003c/code\u003e includes properties such as \u003ccode\u003eAudio\u003c/code\u003e for the speech data as \u003ccode\u003eByteString\u003c/code\u003e and \u003ccode\u003eConfig\u003c/code\u003e to dictate how speech is processed using \u003ccode\u003eInputAudioConfig\u003c/code\u003e , and has a range of versions from 1.0.0 to the latest 2.23.0.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eAudioInput\u003c/code\u003e Class has an \u003ccode\u003eAudioInput()\u003c/code\u003e constructor which can take another \u003ccode\u003eAudioInput\u003c/code\u003e as a parameter.\u003c/p\u003e\n"]]],[],null,[]]