5. Data Collection

Collect acoustic measurements with data points

Overview

Data points allow you to collect acoustic measurements at specific time and frequency locations. Each data point automatically captures:

  • Time and frequency coordinates
  • Pitch (F0)
  • Formants (F1-F4) and bandwidths (B1-B4)
  • Intensity
  • HNR, CoG, Spectral Tilt, A1-P0
  • Labels from all annotation tiers at that time

This is ideal for building datasets for statistical analysis (vowel measurements, consonant measurements, etc.).

Why Use Data Points?

Manual approach (slow):

  1. Place cursor on vowel
  2. Read F1 from values panel
  3. Type F1 into spreadsheet
  4. Read F2 from values panel
  5. Type F2 into spreadsheet
  6. Repeat for 100+ vowels… 😰

Data points approach (fast):

  1. Double-click on each vowel
  2. Export all measurements to TSV
  3. Open in R, Python, Excel — ready for analysis! 🎉

Adding Data Points

  1. Enable overlays you want to measure (Pitch, Formants, Intensity)

  2. Double-click on the spectrogram at the point you want to measure

    Adding a data point
  3. A yellow vertical dashed line appears with a marker

  4. The point is added to the data points list

  5. Repeat for each measurement location

Tip

For vowel formant measurements, double-click in the middle of a vowel’s steady state (where F1 and F2 are stable).

Keyboard shortcuts:

  • Double-click on spectrogram — Add data point
  • Ctrl+Z — Undo adding point

Viewing Data Point Values

Each data point captures all acoustic measurements:

  1. Click on a data point line to select it

  2. The values panel updates to show measurements at that point:

    • Time: 0.452 s
    • Freq: 1234 Hz (where you clicked)
    • Pitch: 234 Hz
    • F1: 923 Hz
    • F2: 1987 Hz
    • F3: 2743 Hz
    • Intensity: 68 dB
    • HNR: 1.3 dB
    • CoG: 3986 Hz
    • Labels: phones=“æ”, words=“cat”

    Data point values panel
Note

The “Freq” value is where you clicked, not a measured acoustic property. It’s useful for reference but typically not used in analysis.

Moving Data Points

Adjust a data point’s position if you placed it slightly off:

  1. Click and drag a data point line left or right (time) or up/down (frequency)

  2. Release to finalize the position

  3. Values update automatically based on the new position

Tip

Use playback (Space) to verify you’re measuring the correct location. The best measurement location is usually the acoustic midpoint of a vowel or steady state.

Removing Data Points

  1. Right-click on a data point line

  2. Select “Remove data point” from the context menu

  3. The point is deleted

Keyboard shortcut:

  • Click data point + Delete key — Remove selected point (if implemented)
  • Ctrl+Z — Undo removal

Example Workflow: Measuring Vowels

Let’s collect formant data for all vowels in a sentence:

  1. Load audio with a sentence (e.g., “see two cats”)

  2. Enable Formants overlay (checkbox)

  3. Optionally annotate vowels with a “phones” tier (section 4)

  4. Zoom in on the first vowel (“ee” in “see”)

  5. Find the vowel midpoint (where formants are stable)

  6. Double-click at the midpoint

  7. A data point appears

  8. Repeat for each vowel:

    • [i] in “see” (high F2, low F1)
    • [u] in “two” (low F2, low F1)
    • [æ] in “cats” (mid F1, mid F2)
  9. Export to TSV (section 6) for analysis

Multiple data points on vowels

Data Points and Annotations

Data points automatically capture labels from ALL annotation tiers at that time:

Example:

If you have tiers:

  • phones tier: interval “æ” from 0.4-0.6s
  • words tier: interval “cat” from 0.3-0.7s

And you add a data point at time 0.5s:

The exported TSV will include:

time    freq    pitch   f1      f2      phones  words
0.500   1500    234     698     1852    æ       cat
Tip

Annotate your audio first (section 4), then add data points. This way, each data point inherits contextual labels automatically.

Use Cases

Vowel formant analysis:

  • Add data points at the midpoint of each vowel
  • Export TSV with F1, F2, F3, vowel label
  • Plot vowel space in R (F1 vs F2)

VOT (Voice Onset Time) measurement:

  • Add data points at burst and voicing onset for stops
  • Export TSV with times
  • Calculate VOT = voicing_time - burst_time

Consonant acoustics:

  • Add data points at fricative midpoints
  • Export CoG, spectral tilt
  • Compare sibilants [s] vs [ʃ]

Practice Exercises

  1. Collect formants for 3 different vowels
    • Add one data point per vowel
    • Check the values panel to see F1 and F2
    • Note which vowel has highest F2
  2. Annotate + measure
    • Create a “vowels” tier
    • Annotate 3 vowels (e.g., “a”, “i”, “u”)
    • Add data points in the middle of each
    • Verify that labels appear in data point info
  3. Practice moving points
    • Add a data point slightly off target
    • Drag it to the correct position
    • Observe how values change
  4. Practice undo/redo
    • Add 3 data points
    • Undo them all (Ctrl+Z × 3)
    • Redo them (Ctrl+Y × 3)

Challenge: Record yourself saying 5 vowels: [i], [ɛ], [a], [ɔ], [u]. Add data points at the midpoint of each. Export to TSV (next section) and plot F1 vs F2 in Excel or R.

Troubleshooting

Double-click adds boundary instead of data point:

  • Ensure you’re clicking on the spectrogram, not on an annotation tier
  • Annotation tiers are below the spectrogram; click in the colored frequency display

Data point appears but values show “—”:

  • Enable the relevant overlays (Pitch, Formants, etc.)
  • Ensure WASM backend is loaded (check backend selector)
  • Wait for analysis to compute (1-2 seconds after zoom)

Can’t move data point:

  • Ensure you’re clicking and dragging the line, not near it
  • Try clicking on the marker circle at the top of the line

Data point values seem wrong:

  • Check you clicked at the correct time and frequency
  • Verify the acoustic overlays show reasonable values
  • For formants, ensure you’re clicking in a vowel, not a consonant

Too many data points, display is cluttered:

  • Remove unnecessary points (right-click → Remove)
  • Zoom out to see overall distribution
  • Export and start fresh if needed
TipQuick Copy to Clipboard

Press Ctrl+C (Windows/Linux) or Cmd+C (Mac) to instantly copy all data points as TSV to your clipboard. Then paste directly into Excel, R, or any text editor. No export dialog needed!

What’s Next?

Now that you’ve collected data points and created annotations, let’s learn how to export everything for further analysis.

Next: 6. Exporting


Navigation: ← Previous: Annotations | Tutorial Overview | Next: Exporting →

Back to top