This page is still small but will eventually be a central place for all things useful for other linguists. It’ll list software I’ve found useful, book recommendations, and links to resources I’ve developed and other helpful pages I write. For now, I can list a couple things.
My handouts, tutorials, and workshops
I’m currently giving a series of workshops on how to use R which will include a variety of topics. I have included PDFs and additional information on each installment of this series.
This tutorial walks you through writing a praat script that extracts formant measurements from vowels. If you’ve never worked with Praat scripting but want to work with vowels, this might be a good starting point.
Vowel plots in R tutorials
This is a multi-part tutorial on how to make sort of the typical vowel plots in R. Part 1 shows plotting single-point measurements as scatter plots and serves as a mild introduction to
ggplot2. Part 2 shows how to plot trajectories, both in the F1-F2 space and in a Praat-like time-Hz space, and is a bit of an introduction to
tidyverse as well.
I’m no expert, but I have given a workshop on how grad students can increase their online presence and make themselves more googleable, based in large part to ImpactStory’s fantastic 30-day challenge, which you can read here.
Last year I gave a workshop on Excel and ended producing a long handout, that goes from the very basics to relatively tricky techniques. The link above will take you to a blog post that summarizes the workshop, and you can also find the handout itself.
Here is a list of resources I’ve found for R. I’ve gone through some of them and others are on my to-do list. These are in no particular order.
The website for Tidyverse is a great go-to place for learning how to use
tidyr, and many other packages.
R for Data Science by Garrett Grolemund & Hadley Wickham is a fantastic overview of tidyverse functions.
Intro to Tidyverse by David Robinson.
R Packages by Hadley Wickham.
Hands-On Programming with R by Garrett Grolemund & Hadley Wickham for writing functions and simulations. Haven’t read it, but it looks good.
ggplot2 by Hadley Wickham is a comprehensive resource for learning all the ins and outs of ggplot2.
Text Mining with R by Julia Silge & David Robinson. Haven’t read it, but it looks great.
Elegant, flexible, and fast dynamic report generation with R by Yihui Xie is a great resource for RMarkdown.
bookdown: Authoring Books and Technical Documents with R Markdown by Yihui Xie. See an introduction to Bookdown by RStudio here.
Visualizing text data with ggplot2 by Colin Fay.
r-statistics.co by Selva Prabhakaran which has great tutorials on R itself, ggplot2, and advanced statistical modeling.
Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing by Justin Matejka and George Fitzmaurice. This went viral in some circles and shows that you can get the exact same summary statistics with wildly different distributions. Very cool.
Studying Pronunciation Changes with gamms by Josef Fruehwald.
Phonetics on Speed: Praat Scripting Tutorial* by Jörg Mayer is what I find myself coming back to again and again.
University of Washington Phonetics Lab has tutorials and scripts.
And I’ve written a tutorial on writing a script for basic automatic formant extraction.
This is my personal favorite. It’s actually a whole collection of tools available through a web interface from Dartmouth University. It can transcribe, align, and extract formants from your (English) audio files all in one go. Previously, its forced aligner is built using Prosody-Lab but now uses the Montreal Forced Aligner (see below).
This probably the most well-known forced aligner. It’s open source and you can download it on your own computer from Joe Fruehwald’s Github page. Or if you’d prefer, you can UPenn’s their web interface instead.
This is a new forced aligner that I heard about for the first time at the 2017 LSA conference. It is fundamentally different than other ones in that it uses a software called Kaldi. I don’t know much beyond that and I haven’t used it yet (though I intend to soon).
According to their website, this “is a set of Python and shell scripts for performing automated alignment of text to audio of speech using Hidden Markov Models.” This is a software available through McGill University that actually allows you to train your own acoustic model (e.g. on a non-English audio corpus). I haven’t done ths yet, but I’ve been meaning to start.
This is a software package with several functions including forced alignment in several languages. Of the aligners you can download to your computer, this might be one of the easier ones to use.
This is another web interface with multiple functions including a forced aligner for several languages.
This advertises itself as a “robust yet lenient forced aligner built on Kaldi.” It’s easy to download and use and produces what appear to be very good word-level alignments of a provided transcript. It even ignored the interviewer’s voice in the file I tried. The output is a .csv file, so I’m not sure how to turn that into a TextGrid, and if you need phoneme-level acoustic measurements, a word-level transcription isn’t going to work.
This is an interesting site that has a interactive simulator of the vocal tract. You can click around and make different vowels and consonants. Pretty fun resource for teaching how speech works.
This site automatically creates a new conlang, based on parameters that you specify. The free web version allows you to add whatever vowels and consonants you’d like to include, and it’ll create a full language: a language name; IPA chart for vowels and consonants; phonotactics; phonological rules; and paradigms for nominal morphology, definite and indefinite articles, personal pronouns, and verb conjugations; derivational morphology; and a lexicon of over 200 words. For $19 you can download the software and get a lexicon of 2000 words, derivational words, random semantic overlaps with natural languages, and the ability to customize orthography, syllable structure, and phonological rules. In addition to just being kinda fun, this is a super useful resource for creating homework assignments for students.
This iPhone app has what they call an “elaborated” IPA chart with lots of extra places and manners of articulation, complete with audio clips of all the sounds. You can play a game where it’ll play a sound and you can guess what you heard. It’s just fun to see things like a voiced uvular fricative (ɢʁ) or a dentolabial fricative [θ̼] on an IPA chart. Credits to University of Victoria linguistics and John Esling’s “Phonetic Notation” (chapter 18 of the Handbook of Phonetic Sciences, 2nd ed.).
“The EMU-webApp is a fully fledged browser-based labeling and correction tool that offers a multitude of labeling and visualization features.” I haven’t given this enough time to learn to use it properly, but it seems very helpful.