[26] audioTimeRange would still be interesting for .volatileResults in SpeechTranscriber

So experimenting with the new SpeechTranscriber, if I do:

let transcriber = SpeechTranscriber(
    locale: locale,
    transcriptionOptions: [],
    reportingOptions: [.volatileResults],
    attributeOptions: [.audioTimeRange]
)

only the final result has audio time ranges, not the volatile results.

Is this a performance consideration? If there is no performance problem, it would be nice to have the option to also get speech time ranges for volatile responses. I'm not presenting the volatile text at all in the UI, I was just trying to keep statistics about the non-speech and the speech noise level, this way I can determine when the noise level falls under the noisefloor for a while.

The goal here was to finalize the recording automatically, when the noise level indicate that the user has finished speaking.

Turns out it was my bad, I had a bug in looking through the runs of the AttributredString, I now found all the audioTimeRanges

Consider using the SpeechDetector module in conjunction with SpeechTranscriber. SpeechDetector performs a similar voice activity detection function and integrates with SpeechTranscriber.

[26] audioTimeRange would still be interesting for .volatileResults in SpeechTranscriber
 
 
Q