This blog post explores various Sound Match options in Textivate, each with a different focus and level of support.
1) L2 audio >> L1 text
In the example below, the prompt is French audio (TTS), and students have to match it with the English text.
To set this up: