nexus/wiki/concepts/TextToSpeech.md

---
title: "TextToSpeech"
type: concept
tags: [ai, speech, text-to-speech]
sources: [我的工具集]
last_updated: 2026-05-11
---

## Definition
Text-to-Speech（TTS）是将文本转换为语音的 AI 技术，也称为语音合成。

## Key Characteristics
- 将书面文本转换为可听的语音输出
- 广泛应用于辅助阅读、语音导航、无障碍访问等场景
- 现代 TTS 系统基于深度学习（如 WaveNet、Tacotron）生成自然语音

## Examples from Sources
- [[Google AI Studio]] 提供免费的 Text-to-Speech 服务，支持 Gemini 模型和 Dialog 对话

## Relationships
- 属于 [[AI时代发展策略]] 的创意工具层
- 与 [[TextToVideo]] 互补：TTS 处理音频，TextToVideo 处理视频