Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to
21 Slides369.50 KB
Tools for Speech Analysis Julia Hirschberg CS4995/6998 Thanks to Jean-Philippe Goldman, Fadi Biadsy
Goals Create stimuli for a perception experiment Record subjects for a production experiment Analyze data from an experiment or from a natural speech corpus Extract features from speech data for a machine learning experiment 2
Many Software Options Goldwave (audio editor) Esps Xwaves (routines visual.) Praat (speech analysis) Wavesurfer (speech editor) Transcriber (annotation tool) Matlab (general purpose soft) OGI speech tools (routines app. dev.) winpitch, pitchworks, phonedit, cooledit . 3
Links www.goldwave.com www.speech.kth.se/software/#esps www.praat.org www.speech.kth.se/software/#wavesurfer http://cslu.cse.ogi.edu/toolkit/ www.mathworks.com (Matlab) www.lpl.univ-aix.fr/ sqlab/ (phonedit) http://www.kvraudio.com/get/696.html (PitchWorks) www.winpitch.com (WinPitch) http://www.adobe.com/special/products/audition/ syntrillium.html (CoolEdit Audition) 4
Our Choice: Praat Developed by Paul Boersma and David Weenink at the Institute of Phonetic Sciences, University of Amsterdam General purpose speech tool : editing, segmentation and labeling, prosodic manipulation, many tutorials, large user community, yahoo group 5
6
File Management Recording files and saving them – New menu Opening files – Read menu Long and short sound files Other file types – Write menu – Exercise: Record a file with your own name, play it to check, call it ‘ your name ’, save it to list, write it to a .wav file on disk, remove it from the objects list, read it back in 7
Editing Options from Objects Window Select and edit your name file Spectrum: – Show a spectral slice – Show a spectrogram Pitch: – Show pitch – Check the settings, change the range – Get pitch information: get pitch, get min/max pitch Intensity: – Get intensity information: similar to pitch functions – Check the settings Formant: Display 8
Modifying the Data Changing the pitch contour of your name file: Go to To manipulation Edit the new object Pitch Stylize pitch (2st) Modify pitch by dragging points up and down Modify duration: – Add points in duration tier – Drag points up and down To save: File - Publish resynthesis 9
Annotation: Textgrids From objects window, w/ sound file selected – Annotate To textgrid – Point vs. interval tiers Add a point tier and an interval tier and insert some labels NB: remember to select the interval or point first in the waveform or spectrogram before trying to insert a label 10
Scripting From history: – Praat new Praatscript Edit Paste history – NB: you can run all or part of the script Writing scripts Modifying existing scripts: – Tutorials, scripts, resources, user groups, search 11
Sample Praat Script # This script will create a new text-grid for a wav file form Make a text-grid for a .wav file comment Source Directory? sentence Directory C:\Documents and Settings\julila\My Documents\ comment File name? sentence Filename comment Tier Name? sentence Tier endform Read from file. 'directory ‘ ‘filename ' stem left (filename ,length(filename )-4) select Sound 'stem ' To TextGrid. 'tier ' 'tier ‘ # tier names, which tiers are point tiers Write to text file. 'directory '\'stem '.TextGrid Remove
Task 1 Read in the file ‘mummy.wav’ Edit it Zoom in on the most prominent Select the entire contour – Display the pitch and intensity contours – What is the minimum pitch? Maximum? Mean? – What is the minimum intensity? Maximum? Mean?
Task2 Record a statement Record a yes-no question Record a wh-question What are the similarities? Differences?
Task3 Record something in a very loud voice, to produce clipping, and see what the waveform looks like – how do you identify clipping? Avoid it?
Task4 Record a file using falling intonation Modify it to produce a rising intonational contour Edit the new contour to – Raise the pitch range (select the contour and use ‘shift pitch frequencies’ – Stylize the pitch contour
Task 5: Pitch contour cloning Replace the pitch contour in beach.wav with the contour in speech.wav – Create a manipulation object for beach.wav – Extract the pitch tier – Create a manipulation object for speech.wav – Select the pitch tier of beach.wav and the manipulation object for speech.wav and click ‘Replace pitch tier’ – Select the manipulation object for speech.wav and click ‘Get resynthesis ’ – Compare the original file and the new file
Task6 Record “My mama moonlights in Memphis” – As angry speech – As sad speech – As happy speech For each token answer the following: – What is the mean pitch? Maximum? – What is the mean intensity? Maximum? – What is the duration? – Do you see any differences in the F0 contour?
Task 7 Edit mummy.wav Convert Change Gender Filter filter (pass) Hann band – Find a pass band that masks the words but retains the intonation – Find a pass band that masks the intonation but retains the words
Help Online help, FAQ, manual Links from http://www.praat.org Additional tutorials, scripts, resources, user groups 20
Next Class Report on your results from the seven tasks 21