3.0 KiB
3.0 KiB
Quick Start Guide
1. Setup Audio Devices
# List available audio devices
./run_transcribe.sh --list-devices
Find your:
- Microphone - Your input device (e.g., "USB Microphone")
- Monitor - Speaker capture device (e.g., "Monitor of Built-in Audio")
2. Basic Usage
Simple Transcription
# Auto-detect devices
./run_transcribe.sh --model medium --language en
# Specify devices
./run_transcribe.sh --mic "USB Mic" --monitor "Monitor"
With File Output
./run_transcribe.sh --model medium --language en --output transcript.txt
With LLM Analysis
./run_transcribe.sh --model medium --enable-llm --output enriched.txt
3. Language Examples
Dutch (Nederlands)
./run_transcribe.sh --model medium --language nl --enable-llm
Spanish
./run_transcribe.sh --model medium --language es
French
./run_transcribe.sh --model medium --language fr
German
./run_transcribe.sh --model medium --language de
4. Model Selection
| Model | Speed | Quality | Command |
|---|---|---|---|
| tiny | Fastest | Basic | --model tiny |
| base | Fast | Good | --model base |
| small | Moderate | Better | --model small |
| medium | Slow | Great | --model medium (recommended) |
| large | Slowest | Best | --model large |
5. Optimization Tips
High Quality Transcription
./run_transcribe.sh --model large --interval 8 --min-duration 4
Fast Real-Time
./run_transcribe.sh --model tiny --interval 3 --min-duration 2
Best Dutch Transcription (Your Setup)
./run_transcribe.sh --model medium --interval 8 --min-duration 4 --enable-llm --language nl
6. LLM Configuration
Default Model (qwen2.5:3b - Fast)
./run_transcribe.sh --enable-llm
Larger Model (Better Analysis)
# Install model first
ollama pull llama3.2
# Use it
./run_transcribe.sh --enable-llm --llm-model llama3.2
7. Output Examples
Console Output
🎤 [14:23:15] User speaking via microphone
🔊 [14:23:20] Audio from speakers
🎤 [14:23:25] The Earth orbits the Sun in 365 days.
✅ FACTUAL (0.98): Scientifically accurate.
❓ Questions:
1. Why do we need leap years?
2. How does orbital speed vary?
3. What affects Earth's orbit?
File Output
Saved to transcript.txt or your specified file with timestamps and analysis.
8. Controls
- Ctrl+C - Stop transcription
- Processing happens every
--intervalseconds (default: 5s) - Minimum
--min-durationaudio required (default: 2s)
Troubleshooting
No devices found:
./run_transcribe.sh --list-devices
Ollama errors:
ollama serve
ollama pull qwen2.5:3b
Force CPU (GPU issues):
./run_transcribe.sh --force-cpu