CUSTOM GRADEVOICE & SPEECH

Wake-Word & Command Corpus

Short-form utterance corpora for wake-word detection, hotword training, and voice-interface intent recognition.

Languages
Multi · scoped
Quality
Controlled conditions
Availability
Sample clips on request

[ OVERVIEW ]

Short-form utterance corpora built specifically for voice-interface models: wake words, hotwords, command phrases, and directed-dialog turns. Recordings are captured under controlled conditions with defined acoustic variation (quiet, ambient, distant-field, and adversarial noise). Every utterance includes speaker demographics, environment class, and signal-to-noise metadata. Scoped per wake-word, per language, or across demographic distributions your product team needs to cover.

[ KEY HIGHLIGHTS ]

  • Controlled acoustic conditions across quiet, ambient, distant-field, and noisy environments
  • Speaker demographic coverage by age, gender, accent, and language
  • Signal-to-noise and reverberation metadata per utterance
  • Custom wake-word and hotword coverage scoped to your product
  • Directed-dialog commands with intent classification labels
  • Far-field and adversarial-noise subsets available
  • Licensed per-wake-word or as the full voice-interface corpus

[ TECHNICAL SPECIFICATIONS ]

Files
Mono WAV, 16-48 kHz, 16-bit, with environment class per recording
Transcripts
JSON with utterance text, speaker metadata, environment class, SNR
Annotations
Wake-word / hotword / command tag · intent classification · environment label
Licensing
Commercial training rights · per-wake-word or full-corpus · demographic distributions scoped per project

More from the catalog.

Explore the full catalog, or scope a custom build matched to your brief.