Ozen-web
Web-Based Acoustic Analysis & Annotation
Ozen-web
Browser-based acoustic analysis and annotation
A browser-based tool for phonetic research, built with Svelte and WebAssembly. Analyze pitch, formants, intensity, and more, with various supporting backends.
Key Features
Phonetic Analysis
Powered by praatfan_rust WebAssembly to provide acoustic measurements: pitch (F0), formants (F1-F4), intensity, HNR, center of gravity, spectral tilt, and more.
Interactive Annotation
Full TextGrid support with multi-tier annotations. Add boundaries, edit labels, snap to higher layer boundaries, with keyboard shortcuts for efficient workflow.
Mobile-Optimized
An alternative touch-friendly viewer-only front end with pinch-to-zoom, drag-to-select, and responsive layout. Aimed for phone-based access to phonetic analysis and embedding in presentations.
Works Offline
Runs entirely in a browser. No external server required, no data uploaded.
Data Collection
Double-click to add points of interest for data collection with acoustic measurements. Export to TSV with all values and annotation labels for statistical analysis, or ctrl-C / command-C to copy to clipboard.
Highly Embeddable
Embed the viewer in Quarto documents, R Markdown, Jupyter notebooks, or any web page. URL parameters for pre-configuration and data URL support for self-contained embeds.
Quick Start
Use the live version
Or download the
docs/live/directory to directory on your server,live/Important: Ozen-web will not work if you just open a local html file in your browser, it has to be served from a web server. The web server can be local though (python has
python -m http.server). If this sounds complicated, try the python interface
See the Getting Started Guide for detailed installation instructions.
Live Demo
Try the interactive viewer below. This example shows a spectrogram with pitch and formant overlays:
The embedded viewer requires audio to be loaded. Click βLoad Audioβ, record yourself, or drag a WAV file to explore features.
Use Cases
- Phonetics Research
- Measure vowel formants, analyze intonation contours, annotate phonetic transcription
- Speech Pathology
- Visualize voice quality measures (HNR, spectral tilt), track therapy progress
- Language Documentation
- Transcribe endangered languages, create time-aligned corpora, work offline in the field with laptop or tablet.
- Teaching & Presentations
- Embed interactive spectrograms in lecture slides, Quarto documents, or course websites. Students can explore audio without installing software.
- Podcast & Audio Production
- Visualize speech clarity, identify noise regions, mark edit points with sub-second accuracy.
Browser Compatibility
Should work on most modern web browsers
Next Steps
- New to Ozen-web? β Start with the Getting Started Guide
- Want a guided walkthrough? β Follow the Complete Tutorial
- Looking for specific features? β Browse Feature Documentation
- Embedding in a website? β Check the Embedding Guide
- Contributing to development? β Read Development Setup
License
MIT License β free for research, teaching, and commercial use.
WASM backend options include MIT/Apache-2.0 (default) and GPL-licensed variants. See WASM Backends Reference for details.
Built with Svelte β’ SvelteKit β’ praatfan β’ TypeScript