Introduction to Audio Signal Processing
Audio signal processing is the intentional manipulation of audio signals to modify their characteristics or extract information. It encompasses a wide range of techniques used in recording, production, distribution, and analysis of audio content. This cheatsheet provides a comprehensive reference for key concepts, techniques, and applications.
Fundamental Concepts
Audio Signal Basics
Concept | Description | Key Parameters |
---|---|---|
Analog Signal | Continuous electrical voltage/current representing sound | Amplitude, frequency, phase |
Digital Signal | Discrete numerical representation of audio | Sample rate, bit depth, format |
Sampling | Process of converting continuous signal to discrete values | Sample rate (Hz), anti-aliasing |
Quantization | Assigning discrete amplitude values to samples | Bit depth, dithering |
Nyquist Theorem | To capture a frequency, sample rate must be at least 2x that frequency | Sample rate ≥ 2 × highest frequency |
Bit Depth | Number of bits used to represent each sample | 16-bit (CD), 24-bit (pro audio), 32-bit float |
Sample Rate | Number of samples captured per second | 44.1kHz (CD), 48kHz (video), 96kHz (high-res) |
Digital Audio Fundamentals
Concept | Description | Applications |
---|---|---|
Time Domain | Amplitude changes over time | Waveform editing, dynamics processing |
Frequency Domain | Energy distribution across frequency spectrum | Spectral analysis, EQ, filtering |
Fast Fourier Transform (FFT) | Algorithm to convert time domain to frequency domain | Spectrum analyzers, frequency-specific processing |
Phase | Relative timing of waveforms (measured in degrees) | Phase alignment, spatial effects |
Latency | Time delay introduced during processing | Real-time monitoring, performance issues |
Aliasing | Artifacts when sampling rate is insufficient | Anti-aliasing filters, oversampling |
Dithering | Adding noise to reduce quantization errors | Bit-depth reduction, mastering |
Core Signal Processing Techniques
Filters
Filter Type | Response | Applications | Parameters |
---|---|---|---|
Low-Pass | Allows frequencies below cutoff point | Remove high-frequency noise, warmth | Cutoff frequency, slope (dB/octave) |
High-Pass | Allows frequencies above cutoff point | Remove rumble, clean up low end | Cutoff frequency, slope (dB/octave) |
Band-Pass | Allows a specific frequency band | Isolate specific elements, effects | Center frequency, Q (bandwidth), gain |
Band-Reject/Notch | Rejects a specific frequency band | Remove interference, hum | Center frequency, Q (bandwidth) |
Shelf | Boosts/cuts frequencies above/below threshold | Broad tonal shaping | Frequency, gain, Q |
Parametric | Precise control of frequency band | Surgical EQ, problem solving | Center frequency, Q (bandwidth), gain |
Graphic | Fixed frequency bands | Quick adjustments, live sound | Number of bands (typically 7-31) |
Filter Slopes/Orders
Filter Order | Slope | Characteristics |
---|---|---|
1st Order | 6 dB/octave | Gentle, minimal phase shift |
2nd Order | 12 dB/octave | Standard filter in many EQs |
3rd Order | 18 dB/octave | Steeper, more noticeable phase shift |
4th Order | 24 dB/octave | Common in crossovers, significant phase shift |
Higher Orders | 30+ dB/octave | Very steep cutoff, substantial phase impact |
Dynamics Processing
Processor | Function | Parameters | Applications |
---|---|---|---|
Compressor | Reduces dynamic range by attenuating loud signals | Threshold, ratio, attack, release, knee, makeup gain | Control levels, increase sustain, add character |
Limiter | Prevents signal from exceeding threshold | Threshold, release, lookahead | Prevent clipping, maximize loudness |
Expander | Increases dynamic range by reducing quiet signals | Threshold, ratio, attack, release | Increase dynamic range, noise reduction |
Gate | Silences signal below threshold | Threshold, attack, release, hold, range | Remove background noise, creative effects |
De-esser | Frequency-specific compression targeting sibilance | Frequency range, threshold, amount | Reduce vocal sibilance, harshness |
Multiband Compressor | Applies different compression to frequency bands | Crossover frequencies, per-band settings | Frequency-specific dynamic control |
Transient Designer | Shapes attack/sustain independently | Attack, sustain | Enhance drums, reduce pick noise |
Time-Based Effects
Effect | Description | Parameters | Applications |
---|---|---|---|
Delay | Creates echo repeats of the signal | Delay time, feedback, mix | Spatial effects, rhythmic enhancement |
Reverb | Simulates acoustic space reflections | Pre-delay, decay time, room size, damping | Create sense of space, depth, ambience |
Chorus | Creates detuned copies of signal | Rate, depth, mix | Thickening, width, movement |
Flanger | Short delay with modulation and feedback | Delay time, rate, depth, feedback | Swooshing sounds, special effects |
Phaser | All-pass filters with phase shifting | Stages, rate, depth, feedback | Sweeping effects, movement |
Tremolo | Amplitude modulation | Rate, depth, waveform | Volume variation, rhythmic effects |
Vibrato | Pitch modulation | Rate, depth, waveform | Pitch variation, vintage effects |
Pitch and Time Manipulation
Process | Description | Parameters | Applications |
---|---|---|---|
Pitch Shifting | Alters pitch without affecting time | Semitones/cents, formant preservation | Vocal tuning, harmonizing, creative effects |
Time Stretching | Alters duration without affecting pitch | Stretch factor, algorithm type | Tempo matching, creative effects |
Formant Shifting | Modifies resonant characteristics | Formant scale | Gender changing, creative vocal effects |
Auto-Tune | Corrects pitch to nearest note | Key, scale, speed, amount | Pitch correction, creative effects |
Vocoder | Impresses characteristics of one signal onto another | Bands, carrier signal | Robot voice, creative synthesis |
Granular Processing | Breaks audio into small grains for manipulation | Grain size, density, pitch, position | Textural effects, radical transformations |
Advanced Processing Techniques
Spectral Processing
Technique | Description | Applications |
---|---|---|
Spectral Analysis | Visualizing frequency content | Identifying problems, mixing guidance |
Noise Reduction | Targeted reduction of unwanted noise | Restoration, cleaning up recordings |
Spectral Editing | Direct editing of specific frequency regions | Surgical removal of problems, restoration |
De-clicking/De-crackling | Removal of transient artifacts | Vinyl restoration, recording cleanup |
De-reverb | Reduction of reverb from recordings | Dry up too-wet recordings, dialogue clarity |
Harmonic Enhancement | Addition of harmonically related content | Adding warmth, presence, character |
Frequency-Dependent Processing | Processing that varies with frequency content | Dynamic EQ, multiband compression |
Convolution and Modeling
Technique | Description | Applications |
---|---|---|
Convolution | Process that combines audio with impulse response | Realistic reverb, amp simulation |
Impulse Response | Acoustic “fingerprint” of space or device | Capturing real spaces, gear modeling |
Algorithmic Modeling | Mathematical simulation of acoustic behaviors | Creating effects, instrument modeling |
Physical Modeling | Simulation based on physical properties | Instrument synthesis, acoustic simulation |
Spatial Processing
Technique | Description | Parameters | Applications |
---|---|---|---|
Stereo Imaging | Manipulation of stereo field | Width, panning, correlation | Create wider/narrower stereo image |
Binaural Processing | Creates 3D audio for headphones | HRTF model, source position | Immersive audio, virtual reality |
Ambisonics | Full-sphere surround sound format | Order, decoding method | 360° audio, VR/AR applications |
Mid-Side Processing | Separate processing of center vs. sides | Mid/side balance, processing | Stereo control, mastering |
Digital Signal Processing (DSP) Concepts
Key DSP Terminology
Term | Description | Importance |
---|---|---|
Algorithm | Set of instructions for processing signal | Defines how effects/processors work |
Buffer Size | Number of samples processed at once | Affects latency and CPU usage |
Vector Size | Similar to buffer size | Processing efficiency metric |
Sample Accurate | Processing aligned precisely to samples | Timing precision |
Zero Latency | Processing without delay | Real-time monitoring |
Oversampling | Processing at higher sample rate | Reduces aliasing, improves quality |
Interpolation | Creating new samples between existing ones | Smooth transitions, upsampling |
Decimation | Reducing number of samples | Downsampling, efficiency |
Processing Architecture
Architecture | Description | Applications |
---|---|---|
Serial Processing | Effects in sequential chain | Standard DAW signal flow |
Parallel Processing | Multiple processes applied separately then mixed | Advanced mixing techniques |
Feedback Loop | Output fed back into input | Special effects, analog modeling |
Sidechain | Using one signal to control processing of another | Ducking, de-essing, creative effects |
Adaptive Processing | Automatically adjusts based on input | Smart EQ, intelligent dynamics |
Look-ahead | Analyzes future samples before processing | Zero-overshoot limiting, transient shaping |
Practical Applications
Recording Processing
Stage | Processes | Purpose |
---|---|---|
Input Conditioning | High-pass filtering, light compression | Clean signal capture |
Noise Management | Gates, expanders, noise reduction | Minimize noise |
Monitoring Effects | Light reverb, low-latency processing | Comfort for performers |
Pre-emphasis | Strategic EQ, dynamics | Capture optimal signal |
Mixing Techniques
Purpose | Processes | Approach |
---|---|---|
Clarity | EQ, compression, stereo positioning | Create space for each element |
Depth | Reverb, delay, level balancing | Create front-to-back perspective |
Width | Stereo imaging, panning, delay | Create stereo field |
Punch | Transient shaping, compression, parallel processing | Enhance impact |
Glue | Bus compression, saturation, reverb | Create cohesion |
Mastering Processing
Purpose | Processes | Considerations |
---|---|---|
Tonal Balance | EQ, spectral processing | Overall frequency balance |
Dynamic Control | Compression, limiting | Consistent loudness, peaks |
Stereo Enhancement | Mid-side processing, stereo imaging | Width vs. mono compatibility |
Loudness Optimization | Limiting, clipping, saturation | Target LUFS, true peak limits |
Format Preparation | Dithering, sample rate conversion | Delivery specifications |
Restoration Techniques
Issue | Processes | Approach |
---|---|---|
Background Noise | Noise reduction, spectral editing | Fingerprint noise, careful reduction |
Clicks and Pops | De-clicking, manual editing | Threshold detection, repair |
Clipping | De-clipping, waveform reconstruction | Reconstruct peaks, reduce distortion |
Reverb Reduction | De-reverb, spectral processing | Extract direct sound |
Sibilance | De-essing, multiband compression | Target 5-8kHz range, careful threshold |
Audio Measurement and Analysis
Level Measurements
Measurement | Description | Applications |
---|---|---|
Peak Level | Maximum instantaneous amplitude | Preventing clipping, headroom |
RMS Level | Root Mean Square – approximates perceived loudness | Mix balancing, loudness assessment |
LUFS | Loudness Units Full Scale – standardized loudness | Broadcast standards, streaming targets |
True Peak | Actual peak including inter-sample peaks | Preventing digital clipping |
Crest Factor | Ratio between peak and RMS | Dynamic range assessment |
Dynamic Range | Difference between loudest and quietest parts | Music dynamics, broadcast compliance |
Analysis Tools
Tool | Function | Applications |
---|---|---|
Spectrum Analyzer | Shows frequency content | Identifying frequency issues |
Phase Correlation Meter | Shows stereo phase relationship | Checking mono compatibility |
Loudness Meter | Measures perceived loudness | Broadcast compliance, streaming targets |
Oscilloscope | Shows waveform in time domain | Detailed signal analysis |
Spectrogram | Shows frequency content over time | Identifying time-based issues |
Goniometer | Shows stereo field representation | Stereo image assessment |
Digital Audio Workstation (DAW) Concepts
Signal Flow
Concept | Description | Importance |
---|---|---|
Insert Effects | Processing applied directly to track | Direct signal processing |
Send/Return (Aux) | Parallel processing via buses | Shared effects, parallel processing |
Pre-fader vs. Post-fader | Whether processing comes before/after level fader | Effect level independence |
Groups/Buses | Combining multiple tracks for shared processing | Organizational and processing efficiency |
VCA Faders | Control levels without affecting processing | Advanced level management |
Routing Matrix | Complex signal path management | Custom signal flows |
Automation
Type | Description | Applications |
---|---|---|
Volume Automation | Dynamic control of levels | Detailed mix balance |
Parameter Automation | Dynamic control of effect parameters | Creative effects, problem solving |
Automation Modes | Read, write, touch, latch | Different editing approaches |
Automation Curves | Linear vs. logarithmic | Natural-sounding transitions |
Format Specifications and Standards
Audio File Formats
Format | Characteristics | Best For |
---|---|---|
WAV/AIFF | Uncompressed, lossless | Recording, mixing, mastering |
FLAC | Compressed, lossless | Storage, high-quality distribution |
MP3 | Compressed, lossy | Distribution, streaming (lower quality) |
AAC | Compressed, lossy | Distribution, streaming (higher quality) |
DSD | 1-bit, high sample rate | Audiophile recordings |
Broadcast Standards
Standard | Specifications | Used For |
---|---|---|
EBU R128 | -23 LUFS, max true peak -1dBTP | European broadcast |
ATSC A/85 | -24 LUFS, max true peak -2dBTP | US broadcast |
Streaming | -14 to -16 LUFS range | Online platforms |
Cinema | -27 LUFS dialog, wide dynamic range | Theatrical release |
Resources for Further Learning
Technical Reading
- “Audio Engineering 101” by Tim Dittmar
- “Mixing Secrets for the Small Studio” by Mike Senior
- “The Science of Sound” by Thomas Rossing
- “Digital Signal Processing: A Practical Guide” by Steven W. Smith
Online Resources
- iZotope Learn Portal
- Sound on Sound Magazine
- ProSoundWeb
- Audio Engineering Society (AES) papers
Software and Tools
- Audio analysis: Voxengo SPAN, iZotope Insight
- Audio editors: Adobe Audition, iZotope RX
- Plugins: FabFilter, iZotope, Waves, UAD
- Testing tools: Room EQ Wizard, ARTA
Remember that while this cheatsheet provides technical information, developing a good ear through critical listening practice is equally important to successful audio signal processing.