# 🎙️ LipSyncEngine.js > High-quality lip-sync animation from audio in the browser [![NPM Version](https://img.shields.io/npm/v/lip-sync-engine)](https://www.npmjs.com/package/lip-sync-engine) [![License](https://img.shields.io/npm/l/lip-sync-engine)](./LICENSE) WebAssembly port of [Rhubarb Lip Sync](https://github.com/DanielSWolf/rhubarb-lip-sync) with TypeScript support. ## ✨ Features - 🚀 **High Performance** - Runs natively in browser via WebAssembly - 🎯 **Accurate** - Uses PocketSphinx speech recognition for precise phoneme detection - 📦 **Small Bundle** - Only ~80KB JavaScript + 2.2MB WASM + models - 🔧 **TypeScript** - Full type definitions included - 🌐 **Framework-Agnostic** - Works with React, Vue, Svelte, vanilla JS, and any framework - 🧵 **Web Workers** - Non-blocking analysis with worker pool - 🔄 **Streaming Support** - Dynamic real-time chunk processing for live audio - 🎨 **Complete API** - Audio utilities, format conversion, microphone recording - 📱 **Browser-Native** - No server required, runs entirely client-side ## 📦 Installation ```bash npm install lip-sync-engine ``` ## 🚀 Quick Start ### Vanilla JavaScript / TypeScript ```typescript import { analyze, recordAudio } from 'lip-sync-engine'; // Record audio from microphone const { pcm16 } = await recordAudio(5000); // 5 seconds // Analyze const result = await analyze(pcm16, { dialogText: "Hello world", // Optional, improves accuracy sampleRate: 16000 }); // Use mouth cues for animation result.mouthCues.forEach(cue => { console.log(`${cue.start}s - ${cue.end}s: ${cue.value}`); // Output: 0.00s - 0.35s: X // 0.35s - 0.50s: D // 0.50s - 0.85s: B // ... }); ``` ### React ```tsx import { useState, useEffect, useRef } from 'react'; import { LipSyncEngine, recordAudio } from 'lip-sync-engine'; function useLipSyncEngine() { const [result, setResult] = useState(null); const lipSyncEngineRef = useRef(LipSyncEngine.getInstance()); useEffect(() => { lipSyncEngineRef.current.init(); return () => lipSyncEngineRef.current.destroy(); }, []); const analyze = async (pcm16, options) => { const result = await lipSyncEngineRef.current.analyze(pcm16, options); setResult(result); }; return { analyze, result }; } function MyComponent() { const { analyze, result } = useLipSyncEngine(); const handleRecord = async () => { const { pcm16 } = await recordAudio(5000); await analyze(pcm16, { dialogText: "Hello world" }); }; return (

{result &&

Found {result.mouthCues.length} mouth cues!

}

); } ``` See [examples/react](./examples/react) for complete example. ### Vue ```vue ``` See [examples/vue](./examples/vue) for complete example. ### Svelte ```svelte {#if $result}

Found {$result.mouthCues.length} mouth cues!

{/if} ``` See [examples/svelte](./examples/svelte) for complete example. ## 📚 Documentation - [Getting Started](./docs/getting-started.md) - [API Reference](./docs/api-reference.md) - [Streaming Analysis Guide](./docs/streaming-analysis.md) - Real-time chunk processing - Framework Examples: - [Vanilla JS](./examples/vanilla/README.md) - [React](./examples/react/README.md) - [Vue](./examples/vue/README.md) - [Svelte](./examples/svelte/README.md) ## 🎨 Mouth Shapes (Visemes) LipSyncEngine.js generates 9 mouth shapes based on Preston Blair's phoneme categorization: | Shape | Image | Description | Example Sounds | |-------|-------|-------------|----------------| | X |

| Lips together | p, b, m | | C |

| F/V sound | f, v | | G |

| Wide open | ee, ih, ey | ## 🔬 How It Works 1. **Speech Recognition** - PocketSphinx analyzes audio to detect phonemes 2. **Phoneme Mapping** - Phonemes are mapped to Preston Blair mouth shapes 3. **Timing Optimization** - Animation is smoothed for natural transitions 4. **JSON Output** - Returns timestamped mouth shape cues ## 📊 API Overview ### Core Functions ```typescript // Simple one-off analysis import { analyze } from 'lip-sync-engine'; const result = await analyze(pcm16, options); // Async analysis (non-blocking) import { analyzeAsync } from 'lip-sync-engine'; const result = await analyzeAsync(pcm16, options); // Using the main class import { LipSyncEngine } from 'lip-sync-engine'; const lipSyncEngine = LipSyncEngine.getInstance(); await lipSyncEngine.init(); const result = await lipSyncEngine.analyze(pcm16, options); ``` ### Streaming Analysis (Real-Time) ```typescript import { WorkerPool } from 'lip-sync-engine'; const pool = WorkerPool.getInstance(4); await pool.init({ /* paths */ }); await pool.warmup(); // Pre-create workers // Create streaming analyzer const stream = pool.createStreamAnalyzer({ dialogText: "Expected dialog", sampleRate: 16000 }); // Add chunks as they arrive from WebSocket, MediaRecorder, etc. for await (const chunk of audioStream) { stream.addChunk(chunk); // Non-blocking! } // Get all results in order const results = await stream.finalize(); ``` See [Streaming Analysis Guide](./docs/streaming-analysis.md) for complete usage patterns. ### Audio Utilities ```typescript import { recordAudio, loadAudio, audioBufferToInt16, float32ToInt16, resample } from 'lip-sync-engine'; // Record from microphone const { pcm16, audioBuffer } = await recordAudio(5000); // 5 seconds // Load from file or URL const { pcm16, audioBuffer } = await loadAudio('audio.mp3'); // Convert formats const int16 = audioBufferToInt16(audioBuffer, 16000); const int16 = float32ToInt16(float32Array); const resampled = resample(float32Array, 44100, 16000); ``` ### Types ```typescript interface MouthCue { start: number; // seconds end: number; // seconds value: string; // X, A, B, C, D, E, F, G, or H } interface LipSyncEngineResult { mouthCues: MouthCue[]; metadata?: { duration: number; sampleRate: number; dialogText?: string; }; } interface LipSyncEngineOptions { dialogText?: string; // Improves accuracy significantly sampleRate?: number; // Default: 16000 (recommended) } ``` ## 🛠️ Development ```bash # Install dependencies npm install # Build WASM module npm run build:wasm # Build TypeScript npm run build:ts # Build everything npm run build # Type check npm run typecheck # Clean build artifacts npm run clean ``` ## 📄 License MIT License - see [LICENSE](./LICENSE) ## 🙏 Credits - Original [Rhubarb Lip Sync](https://github.com/DanielSWolf/rhubarb-lip-sync) by Daniel Wolf - [PocketSphinx](https://github.com/cmusphinx/pocketsphinx) for speech recognition - Preston Blair for phoneme categorization system ## 🐛 Issues Report issues at [https://github.com/biolimbo/lip-sync-engine/issues](https://github.com/biolimbo/lip-sync-engine/issues)