transcribe-voice-notes
Download WhatsApp voice notes, transcribe them, and correct common mining site mis-transcriptions (max 300 chars)
When & Why to Use This Skill
This Claude skill automates the end-to-end workflow of retrieving, transcribing, and refining WhatsApp voice notes. It is specifically optimized for technical environments, featuring a built-in correction engine that fixes common AI transcription errors related to mining site terminology, ensuring that verbal field updates are converted into high-accuracy professional documentation.
Use Cases
- Field Reporting: Mining engineers and site supervisors can record updates via WhatsApp and have them automatically converted into accurate text reports without manual typing.
- Task Management: Convert verbal instructions from team members into structured, searchable tasks while ensuring technical site names like 'Nchwaning 3' are correctly identified.
- Technical Documentation: Maintain a reliable knowledge base of on-site inspections by transcribing audio memos with automatic correction of industry-specific jargon and equipment locations.
- Operational Efficiency: Reduce communication errors in industrial settings by automatically fixing common phonetic mis-transcriptions of site-specific names and technical terms.
| name | Transcribe Voice Notes |
|---|---|
| description | Download WhatsApp voice notes, transcribe them, and correct common mining site mis-transcriptions (max 300 chars) |
Transcribe Voice Notes
Transcribe WhatsApp voice notes with automatic correction of commonly mis-transcribed mining terminology.
When to Use This Skill
- You have WhatsApp voice notes to process
- You want automatic transcription with mining-site name corrections
- You need structured task creation from voice note content
Usage
Via slash command:
/transcribe-voice-notes
Via Python (if running independently):
python transcribe_voice_notes.py --contact "27833911315" --limit 5
Prerequisites
- WhatsApp MCP server running and authenticated
gpt-4o-mini-transcribemodel accessmedia/audio/folder exists in vault
Data Extraction
The skill performs these steps:
- Download: Gets all audio messages from specified WhatsApp contact
- Copy: Moves audio files to
media/audio/for processing - Transcribe: Uses
gpt-4o-mini-transcribeto convert audio to text - Correct: Applies mining site name corrections from built-in dictionary
- Output: Returns corrected transcriptions ready for task creation
Common Mis-Transcriptions (Corrections Applied)
| Mis-Transcribed | Actual Term | Notes |
|---|---|---|
| Trinity | Nchwaning 3 / N3 | Most common - sounds like "Trinity" |
| Hutt | Haut | Plant/equipment location |
| Ubuntu | Update | When discussing software/systems |
| N2/NCH2 | Nchwaning 2 | Sometimes transcribed as different letters |
| Glory/Gloria | Gloria | Mine site name (usually correct) |
Example Outputs
Input: Voice note: "This is an overview of the Trinity underground plant conveyors."
Processed: "This is an overview of the Nchwaning 3 underground plant conveyors."
Input: Voice note: "Why only ten faces blasted?"
Processed: "Why only ten faces blasted?" (No corrections needed)
Output Format
Returns array of corrected transcriptions:
{
"transcriptions": [
{
"timestamp": "2025-11-26 06:49",
"original": "This is an overview of the Trinity underground plant conveyors.",
"corrected": "This is an overview of the Nchwaning 3 underground plant conveyors.",
"corrections_applied": ["Trinity → Nchwaning 3"],
"file_name": "voice_20251126_065442.ogg"
}
]
}
Validation Checks
- ✓ All audio files successfully downloaded
- ✓ All files copied to
media/audio/ - ✓ All transcriptions generated
- ✓ Common corrections applied
- ✓ Output formatted for task creation
Common Issues & Troubleshooting
Issue: "File not found" on transcription
- Solution: Ensure file was copied to
media/audio/before transcribing
Issue: Correction not applied
- Solution: Check if term is in corrections dictionary; add new terms as needed
Issue: Wrong contact selected
- Solution: Verify phone number format (use 27XXXXXXXXX format without +)
Next Steps After Transcription
- Review corrected transcriptions
- Create tasks from voice note content
- Add to task list with appropriate tags (#site, #priority)
- Link to relevant projects if applicable
Advanced: Adding New Corrections
Edit the corrections_dict in transcribe_voice_notes.py:
corrections_dict = {
'trinity': 'Nchwaning 3',
'hutt': 'Haut',
'ubuntu': 'Update',
# Add new corrections here
}
Then restart the skill for changes to take effect.