modelling

Depression detection based on speech data

Nadiia Novakova published on

22 min, 4234 words

In this topic I would like to show how to manage a dataset with many features (especially numeric data with completely unclear meaning and influence on whole dataset)

The dataset contains speech features and clinical variables from participants of a depression related study. Based on speech recordings, vocal features have been derived from different categories. Each feature contains a tag _{pos,neg}, which refers to the vocal task it was extracted from. Clinical and demographic variables of participants can be found at the beginning. The study was meant to show a relation between voice patterns and depression scale (variable ADS).

Read More