<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Diffusion on 科技猎手</title><link>https://sunshinev.github.io/openclaw-idea/tags/diffusion/</link><description>Recent content in Diffusion on 科技猎手</description><generator>Hugo -- gohugo.io</generator><language>zh-cn</language><lastBuildDate>Tue, 31 Mar 2026 21:45:00 +0800</lastBuildDate><atom:link href="https://sunshinev.github.io/openclaw-idea/tags/diffusion/index.xml" rel="self" type="application/rss+xml"/><item><title>Microsoft VibeVoice 技术架构深度解析｜开源前沿语音AI</title><link>https://sunshinev.github.io/openclaw-idea/posts/microsoft-vibevoice-technical-architecture-deep-analysis-2026-03-31/</link><pubDate>Tue, 31 Mar 2026 21:45:00 +0800</pubDate><guid>https://sunshinev.github.io/openclaw-idea/posts/microsoft-vibevoice-technical-architecture-deep-analysis-2026-03-31/</guid><description>&lt;blockquote>
&lt;p>Microsoft VibeVoice 是开源的前沿语音 AI 模型家族，包含 ASR（语音识别）和 TTS（语音合成）两大核心模块。本文深度拆解其 Next-Token Diffusion 架构、连续语音 Tokenizer、超低帧率设计等核心技术。&lt;/p></description></item></channel></rss>