Speech analysis APIs for developers

DeepAffects enables developers to analyze conversational audio by applying powerful machine learning models offered as a set of easy to use REST APIs.

Conversational Metrics API:
Insights from audio

Comprehensive speech analysis of conversations - meetings, customer support calls, interviews, earnings-call. Measure intent, cross-talk, number of questions asked, talk-to-listen ratio, pitch, tone, speech disfluency & more to generate actionable insights from multi-speaker conversations.

Speaker Diarization API:
Who spoke when?

Speaker recognition/diarization is the identification of an individual person based on characteristics found in the unique voice qualities. In an audio recoding with multiple speakers (conference call, dialogs etc.), the Diarization API identifies the speaker at precisely the time they spoke during the conversation. On the left is an audio recording of a debate, the image shows the cluster generated based on the speech pattern and precise time the speaker participated in the conversation.

Emotion Recognition API:
If Emotions Could Talk

Emotion Recognition API identifies emotions from paralinguistic properties of speech (without text based references). Some of the emotions extracted are anger, stress, disgust, etc. Below are the identified emotion metrics extracted from the given audio clip.

Denoising API:
Signal vs. Noise

Media recordings are susceptible to noise. Noise embedded in audio files can be random or white noise. Denoising algorithms are used to remove the noise. Look and listen at the sample audio clips with corresponding outputs displayed below:

play_circle_filled
pause_circle_filled
Clean
volume_down
volume_up
volume_off
play_circle_filled
pause_circle_filled
Noisy
volume_down
volume_up
volume_off
play_circle_filled
pause_circle_filled
Denoised
volume_down
volume_up
volume_off

Custom API:
One Size Doesn't Fit All

For custom use-cases, we provide a solution that learns to recognize patterns in your data. This means that you can not only apply our pre-trained models to your specific use case, but you can train on the most relevant data available.

Developer Portal

Easy step-by-step guide to integrate the speech & text api for developers. Integrate with secure & scalable speech & text APIs, humanize your communication data and put insights to work.

Back to Top