Files
verbatim-dicta/QUICK_START.md
2025-12-17 22:30:41 +01:00

3.0 KiB

Quick Start Guide

1. Setup Audio Devices

# List available audio devices
./run_transcribe.sh --list-devices

Find your:

  • Microphone - Your input device (e.g., "USB Microphone")
  • Monitor - Speaker capture device (e.g., "Monitor of Built-in Audio")

2. Basic Usage

Simple Transcription

# Auto-detect devices
./run_transcribe.sh --model medium --language en

# Specify devices
./run_transcribe.sh --mic "USB Mic" --monitor "Monitor"

With File Output

./run_transcribe.sh --model medium --language en --output transcript.txt

With LLM Analysis

./run_transcribe.sh --model medium --enable-llm --output enriched.txt

3. Language Examples

Dutch (Nederlands)

./run_transcribe.sh --model medium --language nl --enable-llm

Spanish

./run_transcribe.sh --model medium --language es

French

./run_transcribe.sh --model medium --language fr

German

./run_transcribe.sh --model medium --language de

4. Model Selection

Model Speed Quality Command
tiny Fastest Basic --model tiny
base Fast Good --model base
small Moderate Better --model small
medium Slow Great --model medium (recommended)
large Slowest Best --model large

5. Optimization Tips

High Quality Transcription

./run_transcribe.sh --model large --interval 8 --min-duration 4

Fast Real-Time

./run_transcribe.sh --model tiny --interval 3 --min-duration 2

Best Dutch Transcription (Your Setup)

./run_transcribe.sh --model medium --interval 8 --min-duration 4 --enable-llm --language nl

6. LLM Configuration

Default Model (qwen2.5:3b - Fast)

./run_transcribe.sh --enable-llm

Larger Model (Better Analysis)

# Install model first
ollama pull llama3.2

# Use it
./run_transcribe.sh --enable-llm --llm-model llama3.2

7. Output Examples

Console Output

🎤 [14:23:15] User speaking via microphone
🔊 [14:23:20] Audio from speakers

🎤 [14:23:25] The Earth orbits the Sun in 365 days.
   ✅ FACTUAL (0.98): Scientifically accurate.
   ❓ Questions:
      1. Why do we need leap years?
      2. How does orbital speed vary?
      3. What affects Earth's orbit?

File Output

Saved to transcript.txt or your specified file with timestamps and analysis.


8. Controls

  • Ctrl+C - Stop transcription
  • Processing happens every --interval seconds (default: 5s)
  • Minimum --min-duration audio required (default: 2s)

Troubleshooting

No devices found:

./run_transcribe.sh --list-devices

Ollama errors:

ollama serve
ollama pull qwen2.5:3b

Force CPU (GPU issues):

./run_transcribe.sh --force-cpu