TG Voice Whisper Transcriber技能使用说明

2026-03-29 新闻来源：网淘吧围观:150

电脑广告

手机广告

名称：tg-voice-whisper 描述：使用本地OpenAI Whisper（tiny模型）自动转录Telegram语音消息（.ogg Opus格式）。回复时附带转录文本，并自动删除文件以保护隐私。无需API密钥，安装后完全离线运行。表情符号：🎙️🔊

要求：二进制文件：[whisper, ffmpeg]

TG Voice Whisper Transcriber

安装：

类型：apt 软件包：[ffmpeg]
类型：pip 软件包：[openai-whisper] 标志：[--break-system-packages]

最低服务器要求：建议4GB RAM以确保快速稳定运行（模型缓存后，在1vCPU上转录时间<1秒）。

安全性：仅限本地运行，自动删除文件，不存储外部API或凭据。

使用方式（代理）：当语音.ogg文件到达/root/.openclaw/media/inbound/目录时：

exec 'whisper PATH --model tiny --language ru --output_format txt --output_dir /tmp/whisper; text=$(cat /tmp/whisper/*.txt); message action=send message="🔊 [text]" reply_to_current; rm PATH /tmp/whisper/*'

自动设置（非阻塞）：每5秒生成子代理或cron任务：

sessions_spawn task="LOOP: find /root/.openclaw/media/inbound/*.ogg -mmin -1 → background whisper ru tiny → message reply text → rm" label="voice-auto" cleanup="keep"

或使用cron：

cron add job={"name":"voice-auto","schedule":{"kind":"every","everyMs":5000},"payload":{"kind":"systemEvent","text":"🔊 VOICE_CHECK"},"sessionTarget":"main"}

测试：whisper /路径.ogg --model tiny --language ru

注意事项：

首次运行：约15秒下载模型（72MB ~/.cache/whisper/tiny.bin）。
缓存后：在1vCPU/4GB配置下<1秒完成转录。
语言：俄语/英语最佳；--语言检测自动。
准确率：俄语语音识别tiny模型约为85-95%；可升级至base或small模型以获得更好效果。

免责申明

部分文章来自各大搜索引擎，如有侵权，请与我联系删除。

打赏

文章底部电脑广告

手机广告位-内容正文底部

标签

上一篇：Web Form Automation技能使用说明下一篇：小红书自动发布助手

TG Voice Whisper Transcriber技能使用说明

相关文章

推荐文章

热门浏览

标签列表