Introduction to Audio Signal Processing
Audio signal processing is the intentional manipulation of audio signals to modify their characteristics or extract information. It encompasses a wide range of techniques used in recording, production, distribution, and analysis of audio content. This cheatsheet provides a comprehensive reference for key concepts, techniques, and applications.
Fundamental Concepts
Audio Signal Basics
| Concept | Description | Key Parameters |
|---|---|---|
| Analog Signal | Continuous electrical voltage/current representing sound | Amplitude, frequency, phase |
| Digital Signal | Discrete numerical representation of audio | Sample rate, bit depth, format |
| Sampling | Process of converting continuous signal to discrete values | Sample rate (Hz), anti-aliasing |
| Quantization | Assigning discrete amplitude values to samples | Bit depth, dithering |
| Nyquist Theorem | To capture a frequency, sample rate must be at least 2x that frequency | Sample rate ≥ 2 × highest frequency |
| Bit Depth | Number of bits used to represent each sample | 16-bit (CD), 24-bit (pro audio), 32-bit float |
| Sample Rate | Number of samples captured per second | 44.1kHz (CD), 48kHz (video), 96kHz (high-res) |
Digital Audio Fundamentals
| Concept | Description | Applications |
|---|---|---|
| Time Domain | Amplitude changes over time | Waveform editing, dynamics processing |
| Frequency Domain | Energy distribution across frequency spectrum | Spectral analysis, EQ, filtering |
| Fast Fourier Transform (FFT) | Algorithm to convert time domain to frequency domain | Spectrum analyzers, frequency-specific processing |
| Phase | Relative timing of waveforms (measured in degrees) | Phase alignment, spatial effects |
| Latency | Time delay introduced during processing | Real-time monitoring, performance issues |
| Aliasing | Artifacts when sampling rate is insufficient | Anti-aliasing filters, oversampling |
| Dithering | Adding noise to reduce quantization errors | Bit-depth reduction, mastering |
Core Signal Processing Techniques
Filters
| Filter Type | Response | Applications | Parameters |
|---|---|---|---|
| Low-Pass | Allows frequencies below cutoff point | Remove high-frequency noise, warmth | Cutoff frequency, slope (dB/octave) |
| High-Pass | Allows frequencies above cutoff point | Remove rumble, clean up low end | Cutoff frequency, slope (dB/octave) |
| Band-Pass | Allows a specific frequency band | Isolate specific elements, effects | Center frequency, Q (bandwidth), gain |
| Band-Reject/Notch | Rejects a specific frequency band | Remove interference, hum | Center frequency, Q (bandwidth) |
| Shelf | Boosts/cuts frequencies above/below threshold | Broad tonal shaping | Frequency, gain, Q |
| Parametric | Precise control of frequency band | Surgical EQ, problem solving | Center frequency, Q (bandwidth), gain |
| Graphic | Fixed frequency bands | Quick adjustments, live sound | Number of bands (typically 7-31) |
Filter Slopes/Orders
| Filter Order | Slope | Characteristics |
|---|---|---|
| 1st Order | 6 dB/octave | Gentle, minimal phase shift |
| 2nd Order | 12 dB/octave | Standard filter in many EQs |
| 3rd Order | 18 dB/octave | Steeper, more noticeable phase shift |
| 4th Order | 24 dB/octave | Common in crossovers, significant phase shift |
| Higher Orders | 30+ dB/octave | Very steep cutoff, substantial phase impact |
Dynamics Processing
| Processor | Function | Parameters | Applications |
|---|---|---|---|
| Compressor | Reduces dynamic range by attenuating loud signals | Threshold, ratio, attack, release, knee, makeup gain | Control levels, increase sustain, add character |
| Limiter | Prevents signal from exceeding threshold | Threshold, release, lookahead | Prevent clipping, maximize loudness |
| Expander | Increases dynamic range by reducing quiet signals | Threshold, ratio, attack, release | Increase dynamic range, noise reduction |
| Gate | Silences signal below threshold | Threshold, attack, release, hold, range | Remove background noise, creative effects |
| De-esser | Frequency-specific compression targeting sibilance | Frequency range, threshold, amount | Reduce vocal sibilance, harshness |
| Multiband Compressor | Applies different compression to frequency bands | Crossover frequencies, per-band settings | Frequency-specific dynamic control |
| Transient Designer | Shapes attack/sustain independently | Attack, sustain | Enhance drums, reduce pick noise |
Time-Based Effects
| Effect | Description | Parameters | Applications |
|---|---|---|---|
| Delay | Creates echo repeats of the signal | Delay time, feedback, mix | Spatial effects, rhythmic enhancement |
| Reverb | Simulates acoustic space reflections | Pre-delay, decay time, room size, damping | Create sense of space, depth, ambience |
| Chorus | Creates detuned copies of signal | Rate, depth, mix | Thickening, width, movement |
| Flanger | Short delay with modulation and feedback | Delay time, rate, depth, feedback | Swooshing sounds, special effects |
| Phaser | All-pass filters with phase shifting | Stages, rate, depth, feedback | Sweeping effects, movement |
| Tremolo | Amplitude modulation | Rate, depth, waveform | Volume variation, rhythmic effects |
| Vibrato | Pitch modulation | Rate, depth, waveform | Pitch variation, vintage effects |
Pitch and Time Manipulation
| Process | Description | Parameters | Applications |
|---|---|---|---|
| Pitch Shifting | Alters pitch without affecting time | Semitones/cents, formant preservation | Vocal tuning, harmonizing, creative effects |
| Time Stretching | Alters duration without affecting pitch | Stretch factor, algorithm type | Tempo matching, creative effects |
| Formant Shifting | Modifies resonant characteristics | Formant scale | Gender changing, creative vocal effects |
| Auto-Tune | Corrects pitch to nearest note | Key, scale, speed, amount | Pitch correction, creative effects |
| Vocoder | Impresses characteristics of one signal onto another | Bands, carrier signal | Robot voice, creative synthesis |
| Granular Processing | Breaks audio into small grains for manipulation | Grain size, density, pitch, position | Textural effects, radical transformations |
Advanced Processing Techniques
Spectral Processing
| Technique | Description | Applications |
|---|---|---|
| Spectral Analysis | Visualizing frequency content | Identifying problems, mixing guidance |
| Noise Reduction | Targeted reduction of unwanted noise | Restoration, cleaning up recordings |
| Spectral Editing | Direct editing of specific frequency regions | Surgical removal of problems, restoration |
| De-clicking/De-crackling | Removal of transient artifacts | Vinyl restoration, recording cleanup |
| De-reverb | Reduction of reverb from recordings | Dry up too-wet recordings, dialogue clarity |
| Harmonic Enhancement | Addition of harmonically related content | Adding warmth, presence, character |
| Frequency-Dependent Processing | Processing that varies with frequency content | Dynamic EQ, multiband compression |
Convolution and Modeling
| Technique | Description | Applications |
|---|---|---|
| Convolution | Process that combines audio with impulse response | Realistic reverb, amp simulation |
| Impulse Response | Acoustic “fingerprint” of space or device | Capturing real spaces, gear modeling |
| Algorithmic Modeling | Mathematical simulation of acoustic behaviors | Creating effects, instrument modeling |
| Physical Modeling | Simulation based on physical properties | Instrument synthesis, acoustic simulation |
Spatial Processing
| Technique | Description | Parameters | Applications |
|---|---|---|---|
| Stereo Imaging | Manipulation of stereo field | Width, panning, correlation | Create wider/narrower stereo image |
| Binaural Processing | Creates 3D audio for headphones | HRTF model, source position | Immersive audio, virtual reality |
| Ambisonics | Full-sphere surround sound format | Order, decoding method | 360° audio, VR/AR applications |
| Mid-Side Processing | Separate processing of center vs. sides | Mid/side balance, processing | Stereo control, mastering |
Digital Signal Processing (DSP) Concepts
Key DSP Terminology
| Term | Description | Importance |
|---|---|---|
| Algorithm | Set of instructions for processing signal | Defines how effects/processors work |
| Buffer Size | Number of samples processed at once | Affects latency and CPU usage |
| Vector Size | Similar to buffer size | Processing efficiency metric |
| Sample Accurate | Processing aligned precisely to samples | Timing precision |
| Zero Latency | Processing without delay | Real-time monitoring |
| Oversampling | Processing at higher sample rate | Reduces aliasing, improves quality |
| Interpolation | Creating new samples between existing ones | Smooth transitions, upsampling |
| Decimation | Reducing number of samples | Downsampling, efficiency |
Processing Architecture
| Architecture | Description | Applications |
|---|---|---|
| Serial Processing | Effects in sequential chain | Standard DAW signal flow |
| Parallel Processing | Multiple processes applied separately then mixed | Advanced mixing techniques |
| Feedback Loop | Output fed back into input | Special effects, analog modeling |
| Sidechain | Using one signal to control processing of another | Ducking, de-essing, creative effects |
| Adaptive Processing | Automatically adjusts based on input | Smart EQ, intelligent dynamics |
| Look-ahead | Analyzes future samples before processing | Zero-overshoot limiting, transient shaping |
Practical Applications
Recording Processing
| Stage | Processes | Purpose |
|---|---|---|
| Input Conditioning | High-pass filtering, light compression | Clean signal capture |
| Noise Management | Gates, expanders, noise reduction | Minimize noise |
| Monitoring Effects | Light reverb, low-latency processing | Comfort for performers |
| Pre-emphasis | Strategic EQ, dynamics | Capture optimal signal |
Mixing Techniques
| Purpose | Processes | Approach |
|---|---|---|
| Clarity | EQ, compression, stereo positioning | Create space for each element |
| Depth | Reverb, delay, level balancing | Create front-to-back perspective |
| Width | Stereo imaging, panning, delay | Create stereo field |
| Punch | Transient shaping, compression, parallel processing | Enhance impact |
| Glue | Bus compression, saturation, reverb | Create cohesion |
Mastering Processing
| Purpose | Processes | Considerations |
|---|---|---|
| Tonal Balance | EQ, spectral processing | Overall frequency balance |
| Dynamic Control | Compression, limiting | Consistent loudness, peaks |
| Stereo Enhancement | Mid-side processing, stereo imaging | Width vs. mono compatibility |
| Loudness Optimization | Limiting, clipping, saturation | Target LUFS, true peak limits |
| Format Preparation | Dithering, sample rate conversion | Delivery specifications |
Restoration Techniques
| Issue | Processes | Approach |
|---|---|---|
| Background Noise | Noise reduction, spectral editing | Fingerprint noise, careful reduction |
| Clicks and Pops | De-clicking, manual editing | Threshold detection, repair |
| Clipping | De-clipping, waveform reconstruction | Reconstruct peaks, reduce distortion |
| Reverb Reduction | De-reverb, spectral processing | Extract direct sound |
| Sibilance | De-essing, multiband compression | Target 5-8kHz range, careful threshold |
Audio Measurement and Analysis
Level Measurements
| Measurement | Description | Applications |
|---|---|---|
| Peak Level | Maximum instantaneous amplitude | Preventing clipping, headroom |
| RMS Level | Root Mean Square – approximates perceived loudness | Mix balancing, loudness assessment |
| LUFS | Loudness Units Full Scale – standardized loudness | Broadcast standards, streaming targets |
| True Peak | Actual peak including inter-sample peaks | Preventing digital clipping |
| Crest Factor | Ratio between peak and RMS | Dynamic range assessment |
| Dynamic Range | Difference between loudest and quietest parts | Music dynamics, broadcast compliance |
Analysis Tools
| Tool | Function | Applications |
|---|---|---|
| Spectrum Analyzer | Shows frequency content | Identifying frequency issues |
| Phase Correlation Meter | Shows stereo phase relationship | Checking mono compatibility |
| Loudness Meter | Measures perceived loudness | Broadcast compliance, streaming targets |
| Oscilloscope | Shows waveform in time domain | Detailed signal analysis |
| Spectrogram | Shows frequency content over time | Identifying time-based issues |
| Goniometer | Shows stereo field representation | Stereo image assessment |
Digital Audio Workstation (DAW) Concepts
Signal Flow
| Concept | Description | Importance |
|---|---|---|
| Insert Effects | Processing applied directly to track | Direct signal processing |
| Send/Return (Aux) | Parallel processing via buses | Shared effects, parallel processing |
| Pre-fader vs. Post-fader | Whether processing comes before/after level fader | Effect level independence |
| Groups/Buses | Combining multiple tracks for shared processing | Organizational and processing efficiency |
| VCA Faders | Control levels without affecting processing | Advanced level management |
| Routing Matrix | Complex signal path management | Custom signal flows |
Automation
| Type | Description | Applications |
|---|---|---|
| Volume Automation | Dynamic control of levels | Detailed mix balance |
| Parameter Automation | Dynamic control of effect parameters | Creative effects, problem solving |
| Automation Modes | Read, write, touch, latch | Different editing approaches |
| Automation Curves | Linear vs. logarithmic | Natural-sounding transitions |
Format Specifications and Standards
Audio File Formats
| Format | Characteristics | Best For |
|---|---|---|
| WAV/AIFF | Uncompressed, lossless | Recording, mixing, mastering |
| FLAC | Compressed, lossless | Storage, high-quality distribution |
| MP3 | Compressed, lossy | Distribution, streaming (lower quality) |
| AAC | Compressed, lossy | Distribution, streaming (higher quality) |
| DSD | 1-bit, high sample rate | Audiophile recordings |
Broadcast Standards
| Standard | Specifications | Used For |
|---|---|---|
| EBU R128 | -23 LUFS, max true peak -1dBTP | European broadcast |
| ATSC A/85 | -24 LUFS, max true peak -2dBTP | US broadcast |
| Streaming | -14 to -16 LUFS range | Online platforms |
| Cinema | -27 LUFS dialog, wide dynamic range | Theatrical release |
Resources for Further Learning
Technical Reading
- “Audio Engineering 101” by Tim Dittmar
- “Mixing Secrets for the Small Studio” by Mike Senior
- “The Science of Sound” by Thomas Rossing
- “Digital Signal Processing: A Practical Guide” by Steven W. Smith
Online Resources
- iZotope Learn Portal
- Sound on Sound Magazine
- ProSoundWeb
- Audio Engineering Society (AES) papers
Software and Tools
- Audio analysis: Voxengo SPAN, iZotope Insight
- Audio editors: Adobe Audition, iZotope RX
- Plugins: FabFilter, iZotope, Waves, UAD
- Testing tools: Room EQ Wizard, ARTA
Remember that while this cheatsheet provides technical information, developing a good ear through critical listening practice is equally important to successful audio signal processing.
