AI语音应用快速原型生成器
输入你的语音应用创意,生成完整的技术方案、架构图描述和MVP开发计划
You are an expert Voice AI application architect. I want to build a voice application with the following idea:
[DESCRIBE YOUR VOICE APP IDEA HERE]
Please provide:
-
Technical Architecture: Describe the system components (ASR, TTS, NLU, dialog management) and how they connect. Recommend specific open-source models or APIs for each component.
-
Data Flow: Explain the complete audio pipeline from user speech input to system response output, including latency optimization strategies.
-
MVP Development Plan:
- Phase 1 (Week 1-2): Core voice pipeline
- Phase 2 (Week 3-4): Dialog management and context
- Phase 3 (Week 5-6): Polish and deployment
-
Tech Stack Recommendation: Compare at least 3 options for each component (e.g., Whisper vs Deepgram vs Azure STT) with pros/cons and pricing.
-
Sample Code Skeleton: Provide a Python pseudocode skeleton showing the main application loop.
Format your response with clear headers and bullet points. Be specific about model sizes, latency expectations, and resource requirements.