BetterWhisperX By AiBard123 January 7, 2025 - 2 min read BetterWhisperX是WhisperX的改进版本,支持快速的多语种自动语音识别与说话人分离。 read more
AutoGen Book Generator By AiBard123 January 7, 2025 - 2 min read AutoGen书籍生成器是一个基于Python的系统,通过多个AI代理合作生成完整且结构化的书籍。 read more
Structured Outputs Sample Apps By AiBard123 January 7, 2025 - 2 min read Structured Outputs示例应用展示了如何利用OpenAI API的结构化输出功能构建可靠的NextJS应用。 read more
VITA-1.5 By AiBard123 January 7, 2025 - 2 min read VITA-1.5是一款强大的开源交互式多模态大语言模型,支持实时视觉与语音交互。 read more
YouTube Summary Extension By AiBard123 January 6, 2025 - 2 min read YouTube Summary Extension是一款Chrome插件,利用AI生成YouTube视频的简洁总结,支持多种AI提供商。 read more
open-pi-zero By AiBard123 January 6, 2025 - 2 min read open-pi-zero是基于Physical Intelligence的pi0模型,采用MoE架构和预训练的3B PaliGemma VLM实现。 read more