Adaptive Multi-Rate Narrowband codec. sample_rate_hertz must be 8000.
AmrWb
Adaptive Multi-Rate Wideband codec. sample_rate_hertz must be 16000.
Flac
FLAC (Free Lossless Audio
Codec) is the recommended encoding because it is lossless (therefore
recognition is not compromised) and requires only about half the
bandwidth of LINEAR16. FLAC stream encoding supports 16-bit and
24-bit samples, however, not all fields in STREAMINFO are supported.
Linear16
Uncompressed 16-bit signed little-endian samples (Linear PCM).
Mulaw
8-bit samples that compand 14-bit audio samples using G.711 PCMU/mu-law.
OggOpus
Opus encoded audio frames in Ogg container
(OggOpus).
sample_rate_hertz must be 16000.
SpeexWithHeaderByte
Although the use of lossy encodings is not recommended, if a very low
bitrate encoding is required, OGG_OPUS is highly preferred over
Speex encoding. The Speex encoding supported by
Dialogflow API has a header byte in each block, as in MIME type
audio/x-speex-with-header-byte.
It is a variant of the RTP Speex encoding defined in
RFC 5574.
The stream is a sequence of blocks, one block per RTP packet. Each block
starts with a byte containing the length of the block, in bytes, followed
by one or more frames of Speex data, padded to an integral number of
bytes (octets) as specified in RFC 5574. In other words, each RTP header
is replaced with a single byte containing the block length. Only Speex
wideband is supported. sample_rate_hertz must be 16000.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[[["\u003cp\u003eThe latest version available for the Dialogflow v3 API's AudioEncoding is 2.23.0, with a range of previous versions listed, down to version 1.0.0.\u003c/p\u003e\n"],["\u003cp\u003eThe \u003ccode\u003eAudioEncoding\u003c/code\u003e enum defines the audio encoding types supported by the Dialogflow v3 API, for conversational queries, including \u003ccode\u003eAmr\u003c/code\u003e, \u003ccode\u003eAmrWb\u003c/code\u003e, \u003ccode\u003eFlac\u003c/code\u003e, \u003ccode\u003eLinear16\u003c/code\u003e, \u003ccode\u003eMulaw\u003c/code\u003e, \u003ccode\u003eOggOpus\u003c/code\u003e, \u003ccode\u003eSpeexWithHeaderByte\u003c/code\u003e, and \u003ccode\u003eUnspecified\u003c/code\u003e.\u003c/p\u003e\n"],["\u003cp\u003eDifferent audio encodings, such as \u003ccode\u003eAmr\u003c/code\u003e, \u003ccode\u003eAmrWb\u003c/code\u003e, \u003ccode\u003eOggOpus\u003c/code\u003e, and \u003ccode\u003eSpeexWithHeaderByte\u003c/code\u003e, require specific \u003ccode\u003esample_rate_hertz\u003c/code\u003e values, as outlined in their individual descriptions.\u003c/p\u003e\n"],["\u003cp\u003e\u003ccode\u003eFlac\u003c/code\u003e is the recommended encoding due to it being lossless and only using about half the bandwidth of \u003ccode\u003eLinear16\u003c/code\u003e, making it ideal for recognition, while \u003ccode\u003eOggOpus\u003c/code\u003e is recommended over \u003ccode\u003eSpeexWithHeaderByte\u003c/code\u003e if a low bitrate is required.\u003c/p\u003e\n"],["\u003cp\u003eThe Dialogflow API supports a variety of audio encoding types which are detailed in the Cloud Speech API documentation.\u003c/p\u003e\n"]]],[],null,[]]