chore: update 6 file(s)

This commit is contained in:
mike
2025-12-17 22:30:41 +01:00
parent a53c0e2902
commit 4343b7a5a2
6 changed files with 1122 additions and 220 deletions

View File

@@ -1,105 +1,156 @@
# Quick Start Guide
## Dutch Language (Nederlands)
## 1. Setup Audio Devices
### Basic Dutch Transcription
```bash
./RUN_DUTCH.sh
# List available audio devices
./run_transcribe.sh --list-devices
```
- ✅ GPU-accelerated (RTX 4060 Ti)
- ✅ Sentence extraction (complete zinnen)
- ✅ Base model (goede balans snelheid/nauwkeurigheid)
### Dutch with LLM Analysis
Find your:
- **Microphone** - Your input device (e.g., "USB Microphone")
- **Monitor** - Speaker capture device (e.g., "Monitor of Built-in Audio")
---
## 2. Basic Usage
### Simple Transcription
```bash
./RUN_DUTCH_LLM.sh
# Auto-detect devices
./run_transcribe.sh --model medium --language en
# Specify devices
./run_transcribe.sh --mic "USB Mic" --monitor "Monitor"
```
- ✅ All features from basic version
- ✅ Fact-checking van uitspraken
- ✅ Automatische vraag generatie
- Uses llama3.2:latest model
### Save to File
### With File Output
```bash
./RUN_DUTCH.sh --output transcript.txt
./RUN_DUTCH_LLM.sh --output enriched.txt
./run_transcribe.sh --model medium --language en --output transcript.txt
```
### With LLM Analysis
```bash
./run_transcribe.sh --model medium --enable-llm --output enriched.txt
```
---
## English Language
## 3. Language Examples
### Basic English Transcription
### Dutch (Nederlands)
```bash
./RUN_GPU.sh
./run_transcribe.sh --model medium --language nl --enable-llm
```
### English with LLM
```bash
./RUN_GPU.sh --enable-llm
```
---
## Other Languages
### Spanish
```bash
./RUN_GPU.sh --language es
./run_transcribe.sh --model medium --language es
```
### French
```bash
./RUN_GPU.sh --language fr
./run_transcribe.sh --model medium --language fr
```
### German
```bash
./RUN_GPU.sh --language de
./run_transcribe.sh --model medium --language de
```
---
## Available Ollama Models
## 4. Model Selection
You have these models installed:
- `llama3.2:latest` (2.0 GB) - **Default** - Fast and accurate
- `llama3:8b` (4.7 GB) - More powerful
- `qwen2.5:3b` (1.9 GB) - Fast alternative
- `qwen2.5:7b` (4.7 GB) - Powerful alternative
- `qwen2.5:0.5b` (397 MB) - Very fast, less accurate
| Model | Speed | Quality | Command |
|--------|----------|---------|----------------------------------|
| tiny | Fastest | Basic | `--model tiny` |
| base | Fast | Good | `--model base` |
| small | Moderate | Better | `--model small` |
| medium | Slow | Great | `--model medium` **(recommended)** |
| large | Slowest | Best | `--model large` |
To use a different model:
---
## 5. Optimization Tips
### High Quality Transcription
```bash
./RUN_DUTCH_LLM.sh --llm-model "llama3:8b"
./run_transcribe.sh --model large --interval 8 --min-duration 4
```
### Fast Real-Time
```bash
./run_transcribe.sh --model tiny --interval 3 --min-duration 2
```
### Best Dutch Transcription (Your Setup)
```bash
./run_transcribe.sh --model medium --interval 8 --min-duration 4 --enable-llm --language nl
```
---
## Tips
## 6. LLM Configuration
### Better Accuracy
Use larger Whisper model (slower):
### Default Model (qwen2.5:3b - Fast)
```bash
./RUN_DUTCH.sh --model medium # or: large
./run_transcribe.sh --enable-llm
```
### Faster Processing
Use smaller model or reduce interval:
### Larger Model (Better Analysis)
```bash
./RUN_DUTCH.sh --model tiny --interval 3
```
# Install model first
ollama pull llama3.2
### Debug LLM Issues
```bash
./RUN_DUTCH_LLM.sh --llm-debug
# Use it
./run_transcribe.sh --enable-llm --llm-model llama3.2
```
---
## Controls
## 7. Output Examples
- **Ctrl+C** to stop transcription
- Speak clearly into your microphone
- Wait ~5 seconds for transcription to appear
- Sentences appear with 📝 emoji
### Console Output
```
🎤 [14:23:15] User speaking via microphone
🔊 [14:23:20] Audio from speakers
🎤 [14:23:25] The Earth orbits the Sun in 365 days.
✅ FACTUAL (0.98): Scientifically accurate.
❓ Questions:
1. Why do we need leap years?
2. How does orbital speed vary?
3. What affects Earth's orbit?
```
### File Output
Saved to `transcript.txt` or your specified file with timestamps and analysis.
---
## 8. Controls
- **Ctrl+C** - Stop transcription
- Processing happens every `--interval` seconds (default: 5s)
- Minimum `--min-duration` audio required (default: 2s)
---
## Troubleshooting
**No devices found:**
```bash
./run_transcribe.sh --list-devices
```
**Ollama errors:**
```bash
ollama serve
ollama pull qwen2.5:3b
```
**Force CPU (GPU issues):**
```bash
./run_transcribe.sh --force-cpu
```