FAQ & User Guide
Everything you need to know about using SoundMindAI, from recording your first meeting to setting up AI-powered transcription and summarization.
🎯 Getting Started
What is SoundMindAI?
SoundMindAI is a native macOS application that helps you capture, transcribe, and understand audio content. Whether you're recording meetings, lectures, interviews, or podcasts, SoundMindAI transforms spoken words into organized, searchable text with AI-powered insights.
Key capabilities:
- Record system audio and microphone simultaneously
- Import existing audio files
- Transcribe with Apple Speech (free) or premium AI services
- Generate summaries, key points, and action items using AI
- Organize recordings with tags, folders, and search
- Export in multiple formats
What are the system requirements?
- macOS: macOS 13 (Ventura) or later
- Processor: Apple Silicon or Intel Mac
- Storage: 100 MB for the app, plus space for recordings
- Permissions: Screen Recording (for system audio) and Microphone access
How do I grant the required permissions?
On first launch, SoundMindAI will request the necessary permissions. If you need to enable them manually:
Open System Settings
Click the Apple menu > System Settings > Privacy & Security
Enable Screen Recording
Select "Screen Recording" from the list and toggle on SoundMindAI
Enable Microphone Access
Select "Microphone" from the list and toggle on SoundMindAI
Restart the App
Quit and relaunch SoundMindAI for permissions to take effect
How does the 7-day free trial work?
The free trial gives you full access to all features for 7 days. No credit card required.
- Trial starts when you click "Start Free Trial"
- All features are unlocked including BYOK AI services
- Trial is tied to your Mac (hardware-based), not your email
- After 7 days, purchase a license or continue with limited features
After trial expires: You can still record and use Apple Speech transcription. BYOK AI features require a license.
🎤 Recording
How do I start a recording?
Starting a recording is simple:
Click the Record Button
Click the large red record button on the main screen, or use the keyboard shortcut
Select Audio Sources
Choose to record system audio, microphone, or both
Start Recording
The timer will start and you'll see a visual indicator that recording is active
Stop When Done
Click the stop button or use the keyboard shortcut to end the recording
Can I record system audio from specific applications?
SoundMindAI captures all system audio output. To record specific applications:
- Mute applications you don't want to record
- Use the application's own audio settings to control volume
- Consider using a virtual audio device for more control
Future versions may include per-application audio selection.
How do I import existing audio files?
You can import audio files for transcription:
- Drag and drop audio files onto the SoundMindAI window
- Use File > Import Audio or the keyboard shortcut
- Select files from the file browser
Supported formats: M4A, MP3, WAV, CAF, AIFF, MP4, MOV
Where are my recordings stored?
By default, recordings are stored in:
~/Library/Application Support/SoundMindAI/Recordings/
You can change the storage location in Settings > Storage. All recordings remain on your Mac - nothing is uploaded to the cloud.
📝 Transcription
What transcription options are available?
SoundMindAI offers multiple transcription options:
| Service | Cost | Speed | Accuracy |
|---|---|---|---|
| Apple Speech | Free (built-in) | Fast | Good |
| OpenAI Whisper | ~$0.006/min | Fast | Excellent |
| AssemblyAI | ~$0.0065/min | Fast | Excellent |
Apple Speech requires no setup and works offline. BYOK services offer higher accuracy but require API keys.
How do I transcribe a recording?
After recording or importing audio:
- Select the recording from your library
- Click "Transcribe" or use the keyboard shortcut
- Choose your transcription service (Apple Speech or BYOK)
- Wait for transcription to complete
Transcripts appear in the detail view with timestamps for easy navigation.
Can I edit transcripts?
Yes! You can edit transcripts to fix errors:
- Click on the transcript text to enter edit mode
- Make corrections directly in the text
- Changes are saved automatically
- Original timestamps are preserved
🤖 AI Summarization
What AI summaries does SoundMindAI generate?
SoundMindAI uses AI to extract meaningful insights from your transcripts:
- Summary: Concise overview of the content
- Key Points: Important highlights and takeaways
- Action Items: Tasks and follow-ups identified from the conversation
Each item includes timestamps so you can jump to the relevant part of the recording.
Which AI providers are supported?
SoundMindAI supports multiple AI providers for summarization:
- OpenAI - GPT-4o, GPT-4, GPT-3.5-turbo
- Anthropic - Claude 3.5 Sonnet, Claude 3 Opus
- Google - Gemini 1.5 Pro, Gemini 1.5 Flash
- HuggingFace - Various open-source models
- OpenRouter - Access multiple providers through one API
Each provider has different strengths - experiment to find what works best for your content.
How much does AI summarization cost?
Costs vary by provider and model. Typical costs for summarizing a 1-hour transcript:
- GPT-4o: ~$0.05-0.15
- GPT-3.5-turbo: ~$0.01-0.03
- Claude 3.5 Sonnet: ~$0.05-0.10
- Gemini 1.5 Flash: ~$0.01-0.02
Costs depend on transcript length. You pay directly to your chosen provider - SoundMindAI doesn't add any markup.
🔑 BYOK (Bring Your Own Keys) Explained
What does BYOK mean?
BYOK stands for "Bring Your Own Keys." Instead of us charging you for AI services, you get API keys directly from the service providers (like OpenAI or Anthropic) and enter them in SoundMindAI.
Here's how it works:
- You create an account with an AI provider (e.g., OpenAI)
- You add payment and get an API key from them
- You enter that API key in SoundMindAI's settings
- SoundMindAI uses your key to access the AI service
- You pay the provider directly based on your usage
Why does SoundMindAI use BYOK instead of including AI?
BYOK offers significant advantages for you:
- Lower cost: Pay wholesale rates directly to providers instead of marked-up prices
- Choice: Pick the AI provider and model that works best for you
- Privacy: Your data goes directly to the provider - we never see it
- Control: Set your own usage limits and budgets
- No subscription: Pay only for what you use, when you use it
Are my API keys secure?
Yes, your API keys are stored securely:
- Keys are stored in macOS Keychain, the same place your passwords are stored
- Keys are encrypted using macOS system-level encryption
- Keys never leave your Mac (except when making API calls to providers)
- We cannot access, view, or retrieve your keys
🛠 API Setup Guides
How to set up OpenAI (Whisper + GPT)
OpenAI provides both Whisper (transcription) and GPT (summarization):
Create an OpenAI Account
Go to platform.openai.com/signup and create an account
Add Payment Method
Go to Settings > Billing and add a credit card. New accounts may receive free credits.
Generate API Key
Go to API Keys section, click "Create new secret key", and copy it immediately (you won't see it again)
Enter Key in SoundMindAI
Open SoundMindAI Settings > API Keys > OpenAI and paste your key
How to set up Anthropic (Claude)
Anthropic provides Claude for AI summarization:
Create an Anthropic Account
Go to console.anthropic.com and sign up
Add Payment Method
Navigate to Billing and add a payment method
Generate API Key
Go to API Keys, create a new key, and copy it
Enter Key in SoundMindAI
Open SoundMindAI Settings > API Keys > Anthropic and paste your key
How to set up Google (Gemini)
Google provides Gemini for AI summarization:
Go to Google AI Studio
Visit aistudio.google.com and sign in with your Google account
Get API Key
Click "Get API Key" in the left sidebar, then "Create API key"
Copy Your Key
Copy the generated API key
Enter Key in SoundMindAI
Open SoundMindAI Settings > API Keys > Google and paste your key
How to set up AssemblyAI (Transcription)
AssemblyAI provides high-quality transcription:
Create AssemblyAI Account
Go to assemblyai.com and sign up
Get Your API Key
Your API key is shown on your dashboard immediately after signing up
Add Credits
Go to Billing to add credits or set up a payment method
Enter Key in SoundMindAI
Open SoundMindAI Settings > API Keys > AssemblyAI and paste your key
How to set up OpenRouter (Multiple Providers)
OpenRouter lets you access multiple AI providers with one API key:
Create OpenRouter Account
Go to openrouter.ai and sign up
Add Credits
Add credits to your account in the Billing section
Get API Key
Go to Keys section and create a new API key
Enter Key in SoundMindAI
Open SoundMindAI Settings > API Keys > OpenRouter and paste your key
📁 Organization & Management
How do I organize my recordings?
SoundMindAI provides several ways to organize your recordings:
- Tags: Add custom tags with colors to categorize recordings
- Folders: Create folders to group related recordings
- Search: Search by title, transcript content, or tags
- Sort: Sort by date, duration, or name
- Filter: Filter by tags, date range, or transcription status
How do I add tags to recordings?
To add tags:
- Select a recording from your library
- Click the Tags section in the detail view
- Type a tag name and press Enter, or select from existing tags
- Click the color dot to change tag color
You can also batch-tag multiple recordings by selecting them first.
Can I export my recordings and transcripts?
Yes! Export options include:
- Audio: Export original audio file (M4A)
- Transcript: Export as plain text, markdown, or SRT subtitles
- Summary: Export summary, key points, and action items
- Combined: Export everything in one package
Use File > Export or right-click on a recording to access export options.
How do I delete recordings?
To delete recordings:
- Select the recording(s) you want to delete
- Press Delete or right-click and select "Delete"
- Confirm deletion in the dialog
▶️ Playback
How do I play back recordings?
Select a recording and use the built-in player:
- Play/Pause: Space bar or play button
- Seek: Click anywhere on the timeline
- Skip: Arrow keys for 5-second jumps
- Speed: Adjust playback speed (0.5x to 2x)
- Volume: Use the volume slider
Can I jump to specific parts of a recording?
Yes! Click on any timestamped item to jump to that position:
- Click transcript segments to jump to that part
- Click key points to hear the relevant section
- Click action items to hear the context
Timestamps are clickable throughout the interface.
🔧 Troubleshooting
Recording isn't capturing system audio
If system audio isn't being captured:
- Check Screen Recording permission: System Settings > Privacy & Security > Screen Recording. Ensure SoundMindAI is enabled.
- Restart the app: Quit and relaunch SoundMindAI after granting permission.
- Check audio source: Ensure "System Audio" is selected in recording settings.
- Test with another app: Play audio from any app while recording to verify.
Transcription is failing or stuck
If transcription isn't working:
- Apple Speech: Ensure your Mac has an internet connection (required for some languages)
- BYOK services: Verify your API key is correct in Settings
- Check credits: Ensure you have available credits/balance with your AI provider
- File format: Ensure the audio file is in a supported format
- File size: Very long recordings may need to be split
API key errors
If you're seeing API key errors:
- Invalid key: Double-check you copied the entire key with no extra spaces
- Expired key: Some providers expire unused keys. Generate a new one.
- No credits: Check your provider account for available balance
- Rate limited: Wait a moment and try again
You can test your API keys in Settings by clicking the "Test" button next to each key.
How do I contact support?
For support, please email us at:
Please include:
- Description of the issue
- macOS version
- SoundMindAI version (from About menu)
- Steps to reproduce the problem
- Any error messages you see