Some context: I am doing a college project for a machine learning class. I am trying to analyze several thousand short audio clips. The audio is of supreme court testimony. It contains only one voice at a time and no background noise, very vanilla. I'm planning to use the praat voice report to extract features.
I understand that this call makes the voice report:
voice_report_str = parselmouth.praat.call([sound, pitch, pulses], "Voice report", 0.0, 0.0, 75, 600, 1.3, 1.6, 0.03, 0.45)
What are all those parameters? Are they sort of standard, or should I calculate them somehow for each file?
Thanks!