AkkoYK cbe94b84fc feat: 为 FishAudio TTS 添加可选的 reference_id 直接指定功能 (#2513)
* 移除TTS提供商:FishAudio TTS的角色名称查询机制,改为直接使用参考模型ID

/// 修改内容 ///
- 移除复杂的角色查询逻辑
删除了 get_reference_id_by_character 方法
移除了通过角色名称搜索模型ID的API调用逻辑
- 简化配置字段
将 fishaudio-tts-character 字段替换为 fishaudio-tts-reference-id
设置默认值为可莉的模型ID:626bb6d3f3364c9cbc3aa6a67300a664
- 优化代码结构
直接在初始化时获取reference_id
简化请求生成逻辑,直接使用配置的模型ID

/// 修改原因 ///
避免同名冲突:不同模型可能使用相同的角色名称,导致获取错误的模型
提高性能:移除了额外的API查询步骤,减少延迟
增强可靠性:用户直接指定准确的模型ID,避免搜索失败的情况
简化维护:减少了代码复杂度,降低维护成本

/// 新的使用方式 ///
用户需要从 FishAudio 模型的详情页面/URL 中获取具体的模型ID(如 626bb6d3f3364c9cbc3aa6a67300a664),并在配置中直接填入 fishaudio-tts-reference-id 字段。

这个修改使得FishAudio TTS的配置更加直观和可靠,同时提升了系统的整体性能。

* Refactor: 添加FishAudio TTS reference_id格式验证

添加ID格式验证逻辑,防止无效的reference_id调用API失败。
验证32位十六进制格式并提供详细错误提示。

* Feat: 添加FishAudio TTS可选reference_id配置实现向前兼容

新增可选的reference_id字段,优先使用直接ID,未配置时回退到角色名称查询。
保持完全向前兼容,现有配置无需修改。
2025-08-22 16:55:07 +08:00
2025-08-19 01:32:24 +08:00
2025-08-19 01:32:24 +08:00
2025-03-03 13:30:55 +08:00
2025-05-10 10:02:18 +08:00
2025-03-24 17:08:12 +00:00
2025-05-01 20:32:05 +08:00
2023-04-15 15:33:00 +08:00
2025-03-25 13:24:37 +08:00
2025-02-24 00:31:57 +08:00
2025-03-27 21:32:38 +08:00
2025-08-19 01:32:24 +08:00
2025-08-20 16:41:22 +08:00

6e1279651f16d7fdf4727558b72bbaf1

Easy-to-use Multi-platform LLM Chatbot & Development Framework

Soulter%2FAstrBot | Trendshift

GitHub release (latest by date) python Docker pull Static Badge wakatime Dynamic JSON Badge codecov

Documentation Issue Tracking

AstrBot is a loosely coupled, asynchronous chatbot and development framework that supports multi-platform deployment, featuring an easy-to-use plugin system and comprehensive Large Language Model (LLM) integration capabilities.

Key Features

  1. LLM Conversations - Supports various LLMs including OpenAI API, Google Gemini, Llama, Deepseek, ChatGLM, etc. Enables local model deployment via Ollama/LLMTuner. Features multi-turn dialogues, personality contexts, multimodal capabilities (image understanding), and speech-to-text (Whisper).
  2. Multi-platform Integration - Supports QQ (OneBot), QQ Channels, WeChat (Gewechat), Feishu, and Telegram. Planned support for DingTalk, Discord, WhatsApp, and Xiaomi Smart Speakers. Includes rate limiting, whitelisting, keyword filtering, and Baidu content moderation.
  3. Agent Capabilities - Native support for code execution, natural language TODO lists, web search. Integrates with Dify Platform for easy access to Dify assistants/knowledge bases/workflows.
  4. Plugin System - Optimized plugin mechanism with minimal development effort. Supports multiple installed plugins.
  5. Web Dashboard - Visual configuration management, plugin controls, logging, and WebChat interface for direct LLM interaction.
  6. High Stability & Modularity - Event bus and pipeline architecture ensures high modularization and loose coupling.

Tip

Dashboard Demo: https://demo.astrbot.app/
Username: astrbot, Password: astrbot (LLM not configured for chat page)

Deployment

Docker Deployment

See docs: Deploy with Docker

Windows Installer

Requires Python (>3.10). See docs: Windows Installer Guide

Replit Deployment

Run on Repl.it

CasaOS Deployment

Community-contributed method.
See docs: CasaOS Deployment

Manual Deployment

See docs: Source Code Deployment

Platform Support

Platform Status Details Message Types
QQ (Official Bot) Private/Group chats Text, Images
QQ (OneBot) Private/Group chats Text, Images, Voice
WeChat (Personal) Private/Group chats Text, Images, Voice
Telegram Private/Group chats Text, Images
WeChat Work Private chats Text, Images, Voice
Feishu Group chats Text, Images
WeChat Open Platform 🚧 Planned -
Discord 🚧 Planned -
WhatsApp 🚧 Planned -
Xiaomi Speakers 🚧 Planned -

Provider Support Status

Name Support Type Notes
OpenAI API Text Generation Supports all OpenAI API-compatible services including DeepSeek, Google Gemini, GLM, Moonshot, Alibaba Cloud Bailian, Silicon Flow, xAI, etc.
Claude API Text Generation
Google Gemini API Text Generation
Dify LLMOps
DashScope (Alibaba Cloud) LLMOps
Ollama Model Loader Local deployment for open-source LLMs (DeepSeek, Llama, etc.)
LM Studio Model Loader Local deployment for open-source LLMs (DeepSeek, Llama, etc.)
LLMTuner Model Loader Local loading of fine-tuned models (e.g. LoRA)
OneAPI LLM Distribution
Whisper Speech-to-Text Supports API and local deployment
SenseVoice Speech-to-Text Local deployment
OpenAI TTS API Text-to-Speech
Fishaudio Text-to-Speech Project involving GPT-Sovits author

🦌 Roadmap

Tip

Suggestions welcome via Issues <3

  • Ensure feature parity across all platform adapters
  • Optimize plugin APIs
  • Add default TTS services (e.g., GPT-Sovits)
  • Enhance chat features with persistent memory
  • i18n Planning

❤️ Contributions

All Issues/PRs welcome! Simply submit your changes to this project :)

For major features, please discuss via Issues first.

🌟 Support

Demos

Note

Code executor file I/O currently tested with Napcat(QQ)/Lagrange(QQ)

Docker-based Sandboxed Code Executor (Beta)

Multimodal Input, Web Search, Text-to-Image

Natural Language TODO Lists

Plugin System Showcase

Web Dashboard

webchat

Built-in Web Chat Interface

Star History

Tip

If this project helps you, please give it a star <3

Star History Chart

Disclaimer

  1. Licensed under AGPL-v3.
  2. WeChat integration uses Gewechat. Use at your own risk with non-critical accounts.
  3. Users must comply with local laws and regulations.

私は、高性能ですから!

S
Description
No description provided
Readme AGPL-3.0 65 MiB
Languages
Python 69.1%
Vue 25.4%
TypeScript 3.4%
JavaScript 1.4%
Shell 0.3%
Other 0.4%