How to Use AI in Audacity: Step-by-Step Guide to Stem Separation, Noise Suppression, Transcription & Audio Enhancement with OpenVINO

How to Use AI in Audacity: Step-by-Step Guide to Stem Separation, Noise Suppression, Transcription & Audio Enhancement with OpenVINO

Audacity includes integrated AI tools for tasks such as stem separation, noise suppression, and speech transcription, all of which run locally through OpenVINO plugins. These features are particularly useful for creators working with podcasts, music tracks, or interview recordings, enabling them to isolate vocals, clean up background interference, or generate transcripts without needing third-party software or an internet connection.

Performance may vary depending on your system, but for context, I’m using an ASUS Zenbook S14 OLED with 32GB RAM, a 512GB SSD, and an Intel Core Ultra 7 258V processor as my daily driver. In my experience, the performance has been absolutely fine, even when running heavier models like Whisper for transcription or 4-stem separation tasks.

Also read: ASUS ExpertBook P1 First Impressions: A No-Nonsense Business Workhorse with Surprising Power

Step 1: Install Audacity and OpenVINO AI Plugins

Audacity download page for version 3.7.3, listing OS options (Windows, macOS, Linux) and extra resources.
  1. Download and Install Audacity:
Audacity page showing how to install AI effects via OpenVINO plugins, with a download button and 7-step guide.
  1. Download OpenVINO AI Plugins:
    • Access the OpenVINO AI Plugins for Audacity page.
    • Download the installer suitable for your system.
    • Run the installer, select the desired AI models (e.g., noise suppression, music separation, transcription), and complete the installation process.
  2. Enable OpenVINO in Audacity:
    • Open Audacity.
    • Navigate to Edit > Preferences > Modules.
    • Locate mod-openvino and set it to Enabled.
    • Restart Audacity to apply the changes.

Step 2: Perform AI-Powered Stem Separation

Audacity interface showing the “Effect” menu open with the “OpenVINO AI Effects” submenu visible, listing options for Music Separation, Noise Suppression, and Super Resolution.

Stem separation allows you to isolate individual components (e.g., vocals, drums, bass) from a mixed audio track. This makes it easier to remix songs, remove unwanted elements, or enhance specific parts during editing.

  1. Import Your Audio File:
    • Go to File > Open and select the audio file you wish to process.
  2. Select the Audio Track:
    • Click on the track to ensure it’s selected.
  3. Apply Music Separation:
    • Navigate to Effect > OpenVINO AI Effects > OpenVINO Music Separation.
  4. Configure Separation Settings:
    • Choose between:
      • 2-Stem: Separates into vocals and instrumental.
      • 4-Stem: Separates into vocals, drums, bass, and other instruments.
    • Select the processing device (CPU, GPU, or NPU) based on your system’s capabilities.
  5. Execute the Separation:
    • Click Apply and wait for the process to complete. Processing time may vary depending on the length of the track and your hardware specifications.

Step 3: Utilize AI-Based Noise Suppression

Audacity interface with the “Tools” menu open, highlighting the “OpenVINO Whisper Transcription” option among analysis tools like Contrast, Plot Spectrum, and Beat Finder.

Improve the clarity of your audio recordings by minimising background noise and distractions. This helps your voice stand out more clearly, making the final output sound cleaner and more professional.

  1. Select the Audio Segment:
    • Highlight the portion of the track you want to clean.
  2. Apply Noise Suppression:
    • Go to Effect > OpenVINO AI Effects > OpenVINO Noise Suppression.
  3. Adjust Parameters:
    • Modify settings such as suppression level and sensitivity to achieve the desired noise reduction.
  4. Preview and Apply:
    • Use the Preview button to listen to the changes.
    • Once satisfied, click Apply to process the audio.

Also read: Microsoft needs to listen to HP to improve handheld gaming

Step 4: Transcribe Audio Using Whisper Integration

Audacity interface with the “Tools” menu open, highlighting the “OpenVINO Whisper Transcription” option among analysis tools like Contrast, Plot Spectrum, and Beat Finder.

Transcribe spoken words from your audio tracks into text using the Whisper model. It’s a handy tool for generating captions, documenting interviews, or making your content more accessible.

  1. Highlight the Audio for Transcription:
    • Select the segment of the track containing speech.
  2. Initiate Transcription:
    • Navigate to Analyze > OpenVINO Whisper Transcription.
  3. Configure Transcription Settings:
    • Choose the appropriate Whisper model (e.g., base, medium, large) based on your accuracy and performance needs.
    • Set the language or opt for auto-detection.
  4. Execute and Review:
    • Click Apply to start the transcription process.
    • The transcribed text will appear as a label track aligned with the audio.
  5. Export Transcription:
    • Go to File > Export > Export Labels to save the transcription as a text file.

Step 5: Enhance Audio Quality with OpenVINO Super Resolution

Audacity window with the “Effect” tab open, showing the “OpenVINO AI Effects” submenu expanded with three options: Music Separation, Noise Suppression, and Super Resolution. The option to repeat the last used AI effect is also visible.

OpenVINO Super Resolution upscales lower-quality audio by restoring lost detail and improving fidelity, particularly helpful when working with compressed or old recordings.

  1. Select Low-Quality Audio
    Choose the part of the track that sounds dull, muffled, or degraded.
  2. Navigate to Super Resolution Effect
    Effect > OpenVINO AI Effects > OpenVINO Super Resolution
  3. Choose Quality Level
    Options may include mild, standard, and aggressive enhancement modes depending on your install.
  4. Preview and Apply
    Listen to a preview before finalising. Once satisfied, click Apply.

This feature is especially effective for reviving archived audio, podcast edits, or refining user-generated content recorded in subpar conditions.

Tips for Optimal Performance

  • Hardware Acceleration: Utilize GPU or NPU acceleration if available to speed up processing times.
  • Model Selection: Larger models offer higher accuracy but require more computational resources. Choose based on your system’s capabilities and project requirements.
  • Batch Processing: For multiple files, consider using Audacity’s batch processing features to automate repetitive tasks.

Also read: How to Use Intel AI Playground Effectively and Run LLMs Locally (Even Offline)

Yetnesh Dubey

Yetnesh Dubey

Yetnesh works as a reviewer with Digit and likes to write about stuff related to hardware. He is also an auto nut and in an alternate reality works as a trucker delivering large boiling equipment across Europe. View Full Profile

Digit.in
Logo
Digit.in
Logo