Intent Understanding & Parameters
How Wubble identifies your goals and extracts precise technical parameters from natural language
Overview
Intent Understanding and Parameter Extraction is the sophisticated AI system that bridges the gap between how you naturally communicate and the precise technical specifications needed for audio production. When you say "create an upbeat electronic track," Wubble doesn't just recognize words—it understands your goal (music generation), identifies relevant parameters (genre, mood, energy), and infers missing details based on context and best practices.
This system handles ambiguity, asks clarifying questions when needed, applies intelligent defaults, and decomposes complex multi-step requests into executable workflows—all automatically, behind the scenes.
Core Capabilities
Intent Recognition
Intent recognition identifies what you want to accomplish. Wubble recognizes dozens of core intents across all audio production domains:
Generation Intents
- • Generate music
- • Generate voice/speech
- • Generate sound effects
- • Create variations
- • Extend/continue audio
Modification Intents
- • Adjust parameters
- • Apply effects
- • Change style/mood
- • Transform characteristics
- • Edit/trim audio
Processing Intents
- • Separate stems
- • Mix tracks
- • Master audio
- • Clean/enhance quality
- • Normalize/optimize
Analysis Intents
- • Analyze characteristics
- • Identify issues
- • Compare versions
- • Extract metadata
- • Suggest improvements
Management Intents
- • Retrieve asset
- • Save/export
- • Organize files
- • Version control
- • Share/collaborate
Query Intents
- • Ask questions
- • Request explanations
- • Get suggestions
- • Learn techniques
- • Understand concepts
Parameter Extraction
Once intent is identified, Wubble extracts all relevant parameters from your message:
Natural Language → Intent + Parameters
Input: "Create an upbeat indie rock track for a product commercial"
Extracted Intent:
Primary: GENERATE_MUSIC
Context: Commercial production
Extracted Parameters:
genre: "indie_rock"
mood: "upbeat"
energy: "high"
useCase: "commercial"
Inferred Parameters (from context):
duration: 30 seconds (typical for commercials)
tempo: 120-140 BPM (typical for upbeat indie rock)
structure: "intro-hook-outro" (commercial format)
mixingStyle: "broadcast-ready"
Smart Defaults Applied:
sampleRate: 48000 Hz (commercial standard)
bitDepth: 24 bit
format: "WAV"Parameter Categories
Parameter Categories:
1. Creative Parameters (Subjective)
• Mood: happy, sad, energetic, calm, tense
• Style: cinematic, minimal, organic, electronic
• Character: warm, bright, dark, airy, heavy
2. Technical Parameters (Objective)
• Tempo: BPM values
• Key: Musical keys and modes
• Duration: Time in seconds/minutes
• Sample Rate: 44.1k, 48k, 96k Hz
3. Structural Parameters
• Arrangement: intro, verse, chorus, bridge
• Sections: build-ups, breakdowns, transitions
• Layering: number and types of elements
4. Processing Parameters
• EQ: Frequency adjustments
• Compression: Dynamic control
• Reverb: Spatial characteristics
• Effects: Specific processing chains
5. Contextual Parameters
• Use case: commercial, film, game, podcast
• Platform: Spotify, YouTube, Instagram
• Audience: age, demographics, preferences
• Brand: guidelines and requirementsCreative to Technical Translation
Wubble translates creative descriptions into technical parameters: "warm and cozy" becomes specific EQ curves, harmonic content, and reverb characteristics. "Energetic" maps to tempo ranges, rhythmic density, and frequency brightness.
Handling Ambiguity
When requests are ambiguous or lack critical information, Wubble employs several strategies to resolve uncertainty:
Handling Ambiguous Requests:
Ambiguous Input: "Make it better"
AI Analysis:
❓ What aspect needs improvement?
- Mix balance?
- Energy level?
- Sound quality?
- Arrangement?
Wubble Response:
"I'd be happy to improve it! To make the most effective changes,
could you tell me what specifically you'd like improved?
• Mix/Balance - Are levels or frequencies off?
• Energy - Should it be more/less energetic?
• Sound Quality - Technical issues to address?
• Arrangement - Structural changes needed?
• Something else?
Or if you'd like, I can analyze the track and suggest improvements."
Result: Converts vague request into specific, actionable direction.Resolution Strategies
Clarifying Questions
Ask targeted questions to gather missing information. Questions are specific, multiple-choice when possible, and prioritized by importance.
Contextual Inference
Use conversation history, project context, and user preferences to infer likely intent when explicit information is missing.
Smart Defaults
Apply industry-standard defaults based on the type of content being created. Commercial music gets different defaults than podcast narration.
Multiple Options Approach
When truly ambiguous, generate multiple interpretations or variations and let you choose, turning ambiguity into creative exploration.
Confidence Thresholds
Only proceed with uncertain interpretations if confidence is above a threshold, otherwise ask for clarification to avoid mistakes.
Complex Multi-Intent Requests
Wubble can handle requests that contain multiple intents and automatically decompose them into properly sequenced workflows:
Complex Multi-Intent Request:
Input: "Take the track we made earlier, separate the stems, boost the
bass, add more reverb to the vocals, and then master it for Spotify"
Decomposed Intents:
1. RETRIEVE_ASSET
└─ Parameter: "the track we made earlier"
└─ Context: Recent conversation history
2. STEM_SEPARATION
└─ Target: Retrieved track
3. ADJUST_FREQUENCY
└─ Target: Bass stem
└─ Parameter: Boost (increase)
4. APPLY_EFFECT
└─ Effect: Reverb
└─ Target: Vocal stem
└─ Parameter: Increase amount
5. MIX_STEMS
└─ Inputs: All adjusted stems
6. MASTER_AUDIO
└─ Platform: Spotify
└─ Target: -14 LUFS
Execution Order:
Retrieve → Separate → Adjust Bass & Vocal Reverb (parallel)
→ Mix → Master
Wubble handles decomposition and orchestration automatically.Automatic Orchestration
Wubble determines the correct execution order, identifies dependencies, and parallelizes operations when possible for optimal efficiency.
Pipeline Management
Each step's output becomes the next step's input automatically. Intermediate results are managed transparently, with option to inspect any stage.
Confidence & Transparency
Wubble provides confidence scores for its interpretations and explains its reasoning:
High Confidence (>0.9)
Wubble proceeds automatically with extracted parameters. Intent is clear and unambiguous.
Example: "Generate 128 BPM house track in Aminor"
Medium Confidence (0.7-0.9)
Wubble proceeds but mentions assumptions: "I'm creating this with X assumption. Let me know if you want different."
Example: "Make it more energetic" (unclear how much)
Low Confidence (<0.7)
Wubble asks clarifying questions before proceeding to avoid mistakes and wasted effort.
Example: "Make it better" (completely ambiguous)
Best Practices
Combine Creative and Technical
Mix descriptive language with specific parameters: "Warm lo-fi hip hop beat at 85 BPM" provides both creative direction and technical precision.
Provide Context
Mention what you're creating and its purpose. Context helps Wubble apply appropriate defaults and make better inferences about missing parameters.
Answer Clarifying Questions
When Wubble asks for clarification, provide specific answers. This improves future understanding as the AI learns your preferences.
Trust the Inference System
You don't need to specify every parameter. Let Wubble fill in sensible defaults, then refine what needs adjustment.
Use Reference Examples
Referencing existing tracks or artists helps Wubble extract multiple parameters simultaneously from the reference's characteristics.
Review Extracted Parameters
Check what Wubble understood from your request. If parameters aren't right, correct them—this improves future interpretation accuracy.
Be Specific About Changes
When adjusting, specify magnitude: "increase bass by 3dB" is clearer than "more bass." Both work, but precision reduces iteration cycles.
Embrace Iterative Refinement
Start with approximate parameters and refine through conversation. Perfect initial specification isn't necessary—adjustment is natural and expected.