Intro from Jay Allison: What's a good level? Not too hot, not too cold. Just right. Our TOOLS editor Jeff Towne spends a lot of time answering this perennial audio question. Transom is proud to present Jeff's Everything You Wanted to Know About Levels feature, and then some--with audio and visual examples, links to products and resources, the works. Print it out and study it, so you don't have to ask again.
Achieving proper levels when recording and mixing is one of the most fundamental tasks in audio production, yet it remains confusing to many producers. There’s a good reason for that: there is a dizzying array of standards, often in conflict with one another. Adding to the confusion, there’s no single answer for what the “correct” audioto level is, but understanding the most common norms is very important. Independent producers and reporters are increasingly responsible for creating the final audio product, whether it’s a podcast, a short feature, or a complete radio program. In the last few years, engineering and broadcast organizations, as well as governmental agencies, have adopted new standards, mostly based on “Loudness,” so there are finally some tangible targets to shoot for when mixing a program for radio broadcast or for podcasting.
For many years, reporters and producers had the luxury of remaining fuzzy in their understanding of the finer points of audio levels, because there were always audio engineers at radio stations who understood this arcane topic, who would make adjustments before it reached the listeners’ ears. In fact there were often several engineers who would make adjustments to the audio signal as it made its way through the distribution and broadcast chain, and there were standard procedures for aligning the levels that attempted to assure that a relatively consistent sound went out over the airwaves.
But the world has changed. Producers now very often deliver the final audio directly to the listener, as happens in podcasts, or with minimal intervention by network or station engineers, as happens with many radio broadcasts. When a producer uploads a program to the Public Radio Satellite System’s ContentDepot, or PRX, those soundfiles are delivered directly to stations, where they are often simply placed in a station’s automation system, to air at the volume the producer chose.
We’ve all experienced the jarring effect of a commercial blaring out of our radio or television at a volume much higher than the surrounding program, and a similar phenomenon can result if producers do not mix their programs to a standard level.
Most radio stations have hardware designed to control the audio levels of the broadcast chain, which can even-out disparate sources to some degree, but that processing does not always compensate for widely varying sources, so it’s preferable that each program has similar levels to start with.
Peaks vs. Average Levels
The distinction between peak and average levels is crucial. It’s important to attend to peak levels in order to avoid distortion. If an audio signal reaches full-scale, and stays there for more than a passing moment, it will usually sound crunchy, blurry and distorted. On a waveform display, one can see when the tops and/or bottoms of the waves are flattened, rather than displaying the gradual up and down undulations of an undistorted waveform. That’s called “clipping” and it almost always sounds bad. You avoid it by watching peak-reading meters and making adjustments to prevent the levels from reaching “full scale,” also known as 0 dBfs.
Recording in the field is largely concerned with peak levels: the basic task is to record at as high of a level as possible, without clipping.
When mixing or mastering, the concern changes from merely avoiding clipping, to assuring consistent levels over time, and an overall loudness that meshes well with other productions. In order to achieve that, you need to pay attention to Average Levels, and more specifically to “Loudness.” Loudness is a measurement that takes into account not only the average levels, but also the balance of frequencies the sound contains, and other attributes of the sound that contribute to the real human perception of how loud the sound appears. Loudness can be observed by using a loudness meter. You can get close by using meters that display average, or RMS levels. RMS stands for Root Mean Square, and it’s a way of averaging loudness values over time, not only the momentary values of short transient sounds as peak meters favor. But Loudness measurement gives an even more accurate indication of how loud a sound will be perceived by real human ears. There’s much more about loudness, and how to work with loudness meters, right here on Transom.org.
The difficult part of mixing a production so that it has the “right” average levels is that there are several competing standards. Governmental and professional groups have issued guidelines, some merely suggestions, others actual laws. There are perfectly valid arguments to be made for most of them. Some advocate for standards that allow greater dynamic range (the difference between the loudest and quietest possible sounds) Large dynamic range is especially appropriate for classical music, and for film sound, both of which often incorporate dramatic changes between soft and loud passages. But an overly-wide dynamic range can be problematic if the listener is left straining to hear the quiet parts, perhaps due to background noise, or an underpowered audio player (such as a portable mp3 player or a phone) or is startled by dramatic peaks.
On the opposite end of the spectrum, some advocate for a lower dynamic range, which is to say, a louder overall loudness on average. This is more appropriate for talk-radio, as well as many pop-music stations. This louder overall level means that there’s less dynamic range: peaks cannot be louder than the average level by as much, but all parts of the program are more easily heard. This can be especially helpful for listening in loud environments, where there’s background interference, like the road noise one hears when driving in a car. Higher average levels, and smaller dynamic range mean that low-level signals won’t get lost in background noise, but sometimes the audio material can sound squeezed and lifeless.
We could be lifeless, unless you...
Help Transom get new work and voices to public radio by donating now.
Dynamics processing, such as compression and limiting is often required to tame the peaks enough to get average levels up to -12dBfs, but that kind of processing sometimes sounds unnatural. That’s been an ongoing problem in the music industry, sometimes called the Loudness Wars: the average levels of commercial music have been getting higher and higher, until there’s little dynamic range left. This phenomenon is partly the result of developments in audio processing technology, but also personal preferences – there’s a human tendency to perceive louder music as sounding “better.” But digital processing that can manipulate dynamic range, and other audio mixing techniques can be over-used, making audio louder, but lifeless and fatiguing to listen to.
If you mix so average levels are at –24 LUFS (Loudness Units relative to Full Scale) there should be enough dynamic range to sound natural, yet enough loudness to be full and present. That is the standard recommended by the Public Radio Satellite System and PRX, so files uploaded to those distributors should be mixed so that average levels are at that level. There’s an additional twist to that standard: peaks should not exceed –3 dBfs (Decibels relative to Full Scale.) This is similar to mixing with average levels a bit higher and allowing peaks to go to zero, but it’s not quite the same: it’s good practice to leave some open headroom, and not let peaks go all the way to zero. This way, further manipulations of the file, which may happen if a radio station’s automation system or broadcast chain applies additional processing, like a level adjustment, or EQ or compression, won’t cause the signal to distort. A peak that goes completely full-scale, hitting 0 dBfs might sound fine in the original file, but could clip and distort if any kind of processing is applied, even something that reduces volume, like a high-pass filter or other EQ.
PRX recommends following the same level standards that the ContentDepot uses, and for good reason: the idea is for radio programs to have a consistent level no matter where they come from. A station can just drop the files from ContentDepot and PRX into their playback or automation computer and they’ll all play out at a similar loudness, without jarring, abrupt jumps up or down in volume.
These suggested broadcast levels align vaguely with another set of standards proposed several years ago: the K-System, developed by famed mastering engineer Bob Katz.
His main motivation was fighting back against the loudness wars in commercial music. Recordings were getting louder and louder, but sounding worse and worse. He developed the K-System, based somewhat on standard practices in the film industry, to encourage mix engineers and mastering engineers to resist the urge to go louder and louder, but instead to retain dynamic range, for a more natural sound, and to adhere to standard average levels. The K-System acknowledges that different contexts may call for different standards. It recommends average levels at –20 dBfs for high-fidelity music, -14 dBfs for pop music, and –12 dBfs for broadcast.
This system has not been widely adopted, but his explanations of the different standards, and advocacy for a set monitoring volume, are very helpful in understanding, and implementing a thoughtful approach to mixing.
The PRSS/PRX standard of -24 LUFS is not too far off the K14 standard in the K-system, and the K12 standard is similar to current recommendations for podcasting.
But there are still many other conflicting standards, each with a solid rationale for their numbers, and some with different ways of measuring the levels.
In the European Union, the level standards have solidified around “Technical Position R.128” (sometimes called EBU R.128) which sets a target loudness value of –23 LUFS. The BBC used to use a PPM scale, but has adopted the EBU R.128 standard.
Technical personnel from Public Radio stations around the US met, and, just to make things interesting, decided on a slightly different standard from the EBU: -24 LUFS. The two standards are so close as to be almost indistinguishable, so using either of them is likely to be good enough for most purposes.
There is no governing body for the technical specifications of podcasts, but there is a strong push to adopt either -18 or -16 LUFS for average levels.
So, how do you get your audio to those specs? It’s partly a matter of having good level meters, and knowing how to use them. The K-System referenced above, also includes the idea of maintaining a fixed listening level of 83 dB(SPL). Even if you don’t adhere to that specific standard, keeping a fixed monitoring level will go a long way toward making your mixes more even, simply by using your ears. Try to avoid turning the volume of your speakers up and down randomly: have an unchanging setting for your amplifier, or mixer, even your headphones, or whatever you use to listen when mixing. Ideally, a volume control with a very precise knob using stepped gradations will allow you to more accurately create a repeatable monitoring level, but that’s a pretty esoteric requirement. Marking the position of all relevant knobs and/or faders with tape, or a marker, should get you close enough.
Meters are important aids, but remember, what we’re going for is an even loudness over time, but the essential nature of sound is that it’s changing constantly. So, use the meters as a guide, to verify what you’re hearing, but use your ears too. There are some hard-to-quantify characteristics of sound that will make some elements feel louder or softer than meters may indicate. On the other hand, ear fatigue can cause you to gradually increase, or decrease, the levels of a mix, so pay attention to your meters too.
Ideally, a 1 khz sine wave tone, played from within your editing program, should make your meters read –24 LUFS on a loudness meter, and 0 VU on a VU-style meter. Pink noise, registering those same levels on your meters, should show 83 dB (SPL) on a sound level meter placed where you’ll be sitting while mixing. You can use a hardware SPL meter, or there are inexpensive apps for the iPhone and Android smartphones that do the same thing.
For metering within your digital workstation, there are many choices. For Pro Tools, it’s good practice to create a master fader, and insert a good loudness meter as a plug-in on that master track. There are good suggestions for Loudness Meters that can be used with Pro Tools and other programs at the end of the Loudness article here>>.
If you’re able to spend some serious money, the Dorrough Meters from Waves are amazing. Dorrough hardware meters have been standard in the broadcast industry for years, and with good reason. They’re easy to read, and can display a wide range of helpful data. But the plugins are also pretty expensive, street price about $200 at press time (but often on sale for much less.) Warning: these meters do not display a standard Loudness value in LUFS, so they may not be helpful in matching a specific target value in Loudness Units, but they can be very helpful in maintaining a stable, consistent level.
Some programs have very good meters built in. Adobe Audition’s meters can be made very large, with fine detail. Hindenburg Journalist’s standard meter is very good, and Hindenburg even has a Loudness meter that shows precise values over time and can be inserted on any track.
Audition and Hindenburg can also perform Loudness Normalization, which allows you to adjust the average loudness of a clip to precise predefined levels.
It’s important to use your ears to verify that things are sounding right, but meters are important aids too. If you use a reliable meter, it can show you whether your clips have been turned-up too loud, if so, you’ll see the peak meters hit the top of the scale and light-up the red indicators. If this happens, you need to turn down that track, or control the peaks with a dynamics processor.
But if your meters are showing that your average levels are too low, but you have loud peaks that are preventing you from increasing the levels, you may need to use volume automation, or insert a dynamics processor, such as a compressor or limiter. In fact it’s often necessary to use some compression or limiting to get an unaccompanied spoken voice up to a desired level.
Understanding one’s meters is not as simple as you might hope. Measuring sound is a complex task – there are many different aspects of it that can be quantified (electrical power, sound pressure levels, relative intensity and more). Those different characteristics of sound are all interrelated, and although the units of each measurement can be expressed in “decibels” (dB), they’re actually different values, depending on what kind of dBs are being measured. It’s a hard concept to visualize, but decibels are relative, rather than absolute values, they are always a comparison to some standard –– it’s dB relative to something.
Just to make things even more odd, decibels use a logarithmic scale, so that a relatively manageable range of numbers can represent a very wide range of intensities. If decibels were linear units, like inches or liters, we’d have to resort to cumbersomely large numbers in order to describe the full range of sound intensities that we encounter.
There are two practical consequences of these quirks of measurement: a dB is not a simple unit with a standard value like a mile or a gram, and the loudness represented by each dB gets larger and larger as the values increase. The difference in intensity changes dramatically with even small movement along the scale.
The dBs that the general public might be familiar with are properly labeled as dB(SPL). SPL stands for Sound Pressure Level, and this scale is used for the loudness of audio phenomena in the environment, as perceived by human ears. In this case the ratio is based on the quietest perceivable sound having a value of 0 dB(SPL) and the intensity increases in a complex way. A very rough rule of thumb is that the perceived loudness doubles every 10dB. (You’ll find different values for that rate of doubling when calculating voltages, or power, or sound pressure – but the lesson to take from it is that small changes in the numbers of dBs of any type can represent large changes in perceived volume.)
The perception of loudness is more complex than can be represented with a simple number, but it’s generally accepted that 40-60 dB(SPL) is the value for conversational speech, 80-90 dB(SPL) for loud traffic, and 130-140 dB(SPL) for a loud rock concert, which is also considered the threshold of pain.
As mentioned above, the K System uses the “magic” value of 83 dB(SPL) as a monitoring level, and that has become something of a standard in mixing and mastering studios. If you align your listening environment so that your reference level, the average loudness of your sound, is always 83 dB(SPL), you can often get very consistent levels mostly just by using your ears. But even having a stable monitor volume is not always reliable on its own: your perception of loudness can be altered by many factors, like fatigue, health, the monitoring environment, extraneous noise, and other variables, so be sure to also check your meters.
Conversely, meters are not completely reliable: human perception of sound is very complex, and does not react in the even, linear ways like a meter might. The duration of a sound will affect its perceived loudness, as will the shape and complexity of the waveform, its pitch, and several other attributes.
Many attempts have been made to tailor the scale to represent the ways the human ear perceives the sound, and you may see sound levels described as A-Weighted, or any of several other curves that have come into use for specific purposes, all compensating for the non-linearity of the human perception of sound. Just to make things even more complicated: human hearing is by definition about perceiving changes in SPL, not a static value, so all of these measurements are also made over time.
Loudness metering, using Loudness Units attempts to include more of those perceptual elements, and more closely reflect the experience of human hearing. Loudness units don’t correlate simply with decibels, but they work in a similar way, and you’ll still see decibels being used for some measurements of sound, even as Loudness Units become more common.
In order to fully understand your meters, you need to have at least a passing familiarity with the scale they are using. Whether they’re marked in dB, or LU, the scale we encounter most often in the modern audio recording world are units relative to full-scale. Those are the numbers used on the digital meters we most commonly see on recorders and in computer software. In this system, full-scale, the highest intensity of signal that can be encoded, is represented by the number zero, and all other levels are indicated by negative numbers: how many dB the signal is below that full-scale. As a signal gets more intense, its dBfs value will get smaller: -2dBfs is louder than –5dBfs, which is louder than –10dBfs, and the same is true for LUFS.
Older equipment often featured VU meters; in most cases they were mechanical needles pivoting across an arched scale. This scale did not use zero to represent the absolute maximum level. That zero does not mean the same thing as the full-scale zero: this zero indicates a value calibrated to a standard output voltage. Using this meter, it was possible, even desirable, to exceed the zero value at times; peaks above 0VU were fine.
Both kinds of metering are useful in their own ways, and each is better suited for measuring different aspects of an audio signal. The trick is that it’s very hard to represent sound on a meter: the very nature of sound is that it’s the CHANGE in sound intensity and pressure over time, and there’s a very complex relationship between the physical properties of the sound itself and the way our ears perceive it. So any kind of metering is an approximate representation of what we want to know. If one learns to use them, meters can be very helpful in getting clean, predictable, evenly-balanced sound, but in the end, trust your ears…
Recording in the Field
The primary task of field recording is to record as loud as is practical, without ever clipping, which is to say, hitting 0 dBfs, often indicated by a red light. The meters on various recorders behave differently, so it may take a little while to get used to properly interpreting what those meters are telling you.
The excellent Sound Devices recorders have very accurate and readable meters, but the way they are designed, desirable, safe, recording levels can cause the meters to flash red. You just have to get used to this quirk, and realize that in this rare case, seeing red indicators on the meters is OK, although you still need to avoid lighting up that very top LED light.
Seeing red lights, or other clipping indicators on a meter is usually not good: the generally accepted convention is for red lights to indicate clipping, but any given meter may indicate them in other ways, be sure to learn how your recorder displays clips, and avoid them! Some recorders even have clip indicators separate from the regular level meter. If input levels are high enough to be triggering those clip lights, the first thing to do is to turn down the input gain. If the source is very loud, you may need to engage a “pad” on either the recorder or the microphone, which will knock the level of the incoming signal down by a preset amount.
The tricky part is to not turn your inputs down too low. If you record at too low a level, you risk creating a noisy recording. Sounds recorded at too low of a level will need to be boosted when mixing, and increasing the level of the field recording when mixing will also boost any residual noise from the recorder and microphone. If you record extremely low levels, your sound may also be muddy and indistinct, because you haven’t given the recorder enough signal to convert to useful digital information. It’s like under-exposing a digital photo: you can brighten it back up in an image editing program, but the picture will look grainy and blocky because you used so little of the sensor’s range to encode the data. The same holds true for audio: raising the gain dramatically later in your computer will reveal noise and other imperfections of the recording chain.
A common rookie mistake is to turn the input volume down when hearing background noise during recording. That noise is going to reappear if you have to raise the gain when you mix, so it’s better to try to mitigate that while recording, by getting closer to your subject, or moving away from the source of the noise, or trying a different microphone, or adjusting the settings on your recorder, such as engaging a high-pass filter.
(There’s one exception to that rule: the microphone preamps on some recorders are slightly more noisy when turned ALL the way up to their absolute maximum. You may need to experiment with your particular combination of microphone and recorder and see if there’s added noise when you set the input knob up at 9 or 10.)
As a rule, record as loud as you can, without clipping. Watch your meters carefully, and be ready to adjust them (gently, gradually!) on the fly if your meters are reading too high or too low. It’s better to err on the low side than the high, quiet levels are easier to fix in the mix than those with distorted peaks, but recordings made at TOO low a volume can also be problematic.
Many recorders offer Automatic Gain Control (AGC) and/or Limiting, which can adjust the levels automatically, but these vary widely in quality. With very few exceptions, AGC will create an unnatural-sounding recording, with background sounds pumping up and down in an unpleasant way. It’s also important to verify how the AGC works: some Zoom recorders, for instance, do not adjust the level over time, instead setting a fixed recording level based on the signal intensity present when placing the machine in record-ready mode. This avoids the pumping sounds of conventional AGC, but it also does not react to changing audio levels, and may therefore still allow distorted or too-low recordings.
Built-in limiters also range widely in their sound quality and effectiveness. In theory, a limiter can tame unexpected peaks by automatically reducing the input gain on signals that exceed a certain threshold level. Limiters tend to be more specific in their action than AGC circuits, and so, can sound quite good. They can be a lifesaver when encountering unexpected loud sounds, or troublesome sources that have excessively large differences between loud and soft components. But you may need to experiment with your recorder in order to decide whether its limiter sounds good or not, many do not sound great… Sony and Sound Devices field recorders have limiters that manage to sound very natural, but many recorders have slow-acting limiters that tend to overshoot, and take too long to return to normal record levels, creating odd-sounding volume dips on the recording following a loud peak.
Ideally, one would record without AGC or limiting, and even-out the levels in the more controlled environment of a digital workstation. But if avoiding clipping means that you’d be forced to record very low signals, you may need to engage one of those tools, or carefully ride the input levels by manually adjusting the gain knob.
In the Mix
At the mix stage, the engineer’s relationship with levels is completely different from recording in the field. Yes, one still needs to avoid clipping, so that your final mix doesn’t distort, but you’re less concerned with getting your levels as loud as possible, and more concerned with making them even. This is all about getting the average levels even, not the peak levels, and getting those average levels to match accepted standards.
As mentioned above, the expected levels in the Public Radio system in the US is to mix your session so that average levels are at –24 LUFS and peaks do not exceed –3 dBfs. You achieve that by adjusting the levels of each clip in your project, so that your audio level meters, inserted on the main output, show those levels.
The volume of the individual clips can be adjusted in three main ways: by adjusting the level of the clips themselves, by doing volume adjustments to the track by riding the fader or writing volume automation, or by using dynamic processors, like compressors and limiters, either on the track or applying the effect to the region.
Many people start by “normalizing” individual regions or clips. This can be helpful, or counterproductive, depending on the nature of the recordings. If the audio is extremely quiet, performing a process like this can be a handy way to get each clip close to what you’re looking for. The problem is that most Normalizing processes look only at peak levels, and adjust the region so that its loudest point is up at 0 dBfs. That output level can sometimes be adjusted, but keep in mind that most normalizing is done based on peaks.
But peak levels don’t correlate well with how loud the clip sounds, you need to look at average (RMS) levels, or Loudness, for that. There are some RMS or Loudness normalizing utilities out there, but most of the time when you see “normalizing” if refers to Peak normalizing. Loudness normalizing will get you much closer to even volumes across clips. The downside of it is that the process could do something bad to the sounds’ peaks; there’s even the chance that you could force a peak to clip and distort. So you need to watch both the peaks and the average levels, as you change a region’s volume.
Another problem with normalizing is that, in many programs, that process writes a new file with the new volume, which eats-up disc space, and also makes it harder to go back and change your mind about where a region begins or ends. Not all programs write a new file when normalizing, but most do, including Pro Tools, Audition and Audacity.
One of the attractive things about Hindenburg Journalist is that its leveling functions, including auto-level, do not write new files, the adjustment is completely non-destructive. The amount of boost or cut can be tweaked as much as desired, the boundaries of the volume adjustments can be moved as well, you can even apply, and adjust, crossfades across the borders of regions with different volumes. More important, the auto-leveling is based on Loudness.
If the levels of individual clips are fairly well recorded, therefore close to the right level to start, it’s probably better to skip normalizing or other kinds of gain applied directly to the individual regions, and instead use volume automation and/or compression.
In most mixes, you’ll still want to do some volume automation on some tracks to compensate for momentary loud or soft sections. It’s best to adjust the levels of individual tracks, not the final mixed output. Each track may need unique volume changes, which may interact with one another as each change is made, so listen carefully after you make a change, and watch your meters: make sure that the volume changes you are making are resulting in a healthy level at the final mix, but are not causing clips and distortion. This is especially likely to be a concern if you are layering sounds; each element on its own might be fine, but two or three tracks playing together may create an overload on the stereo master track.
Automating the output volume of a track is a straight-ahead process in most editing programs: there’s usually a graphical line one can adjust when in volume mode, or perhaps the upper edge of the waveform image can be dragged up or down, to raise and lower the volume. The visual depiction of the waveform can be helpful in guiding you to the spots that need boosting or cutting, but be sure to use your ears, more than your eyes. Some variation of the levels is natural and desirable so don’t over-smooth it. If you do ramp the volume up and down, be sure to be gradual; in most cases, abrupt changes to a track’s volume will sound unnatural. If a track sounds like it needs constant riding of the levels up and down, it’s a good candidate for dynamics processing.
The third way of evening-out the audio level is to reduce the dynamic range of the audio by using compression and limiting. Getting your peaks at the right level does not guarantee that your average levels will be correct. Of course the converse is true as well: if you mix so that your average levels are correct, you may find that peaks are too high, perhaps even clipping. If that’s happening to your mix you may be able to address it with volume automation, but the more practical answer is often to use compression and/or limiting.
It’s important to note that the term “compression” is used in different ways in the audio world. One usage refers to data compression; reducing the data-size of a digital audio file by converting it to an MP3, or AAC or OGG, or other such file types, in order to make it easier to store or transmit.
But we’re talking about the other kind of compression: audio level compression, or dynamic range reduction. Compression and Limiting are just two flavors of the same process: Limiting is just extreme compression, usually applied only to the loudest parts of a sound. Both types of processing reduce the levels of loud sounds, while leaving the quieter sections alone. The result is a more-manageable dynamic range, with reduced peak levels, and less difference between the loud parts and the quiet parts. With the peaks reduced, the overall level can be increased, making the average levels louder. Some compressors and limiters do that automatically – that process is sometimes called Upward Compression. If the compressor you’re using does not do that automatically, you may need to adjust the parameter on the compressor called “make-up gain” in order to return the processed track to an ideal average level.
In general, you’ll get better results by applying compression to individual tracks, rather than the final stereo master track, although it’s often advisable to do both. In fact, it’s often very handy to put some gentle compression, and then brick-wall limiter, such as the Waves L1, or the Maxim, or LoudMax, on the stereo master track, just as a safety, to catch any stray peaks. I always have a limiter plug-in on my master fader, with the output level set to –3 dBfs, so that no matter what I do, no peaks will exceed that level.
If my average levels are too low, I’ll lower the threshold of the limiter, the level at which it starts to take effect. That has the effect of increasing the overall average mix level, while still holding the peaks at –3 dBfs. The Limiters I mentioned above all have automatic gain compensation, so as the action of the limiter increases, the average level of the program material increases. Many traditional compressors and limiters work the other way: the more compression or limiting is applied, the lower the overall output levels. Compressors always include a control for make-up gain, allowing the overall signal volume to be raised to compensate for the reduced peak levels.
Compressing or limiting the final master track may not be enough, or may introduce undesirable audible artifacts, like the volume of music or ambience backgrounds jumping up and down. In many cases, it’s best to compress or limit individual tracks, so that only those elements are affected by the processing.
Voice tracks in particular, can often benefit from some compression, just to even-out the natural tendency of the spoken word to have a wide range of volumes.
Gregg McVicar wrote a good column about adjusting voice levels for Transom.org several years ago that still offers good advice.
Getting good results with a compressor takes some practice, but in general, you activate the compressor, play some audio into it, and then adjust the ratio and threshold controls until you start to see some gain reduction on loud peaks. I prefer to insert a compressor as a plug-in on a track and let it affect the whole track.
If you’re inserting compression on a track, it makes sense to place different voices on individual tracks, and use a compressor on each track, if needed, possibly with a different setting for each.
Looking more closely at the compressor plug-in: the ratio control adjusts the severity with which the compressor acts. A 3:1 ratio means that for every 3dB the original sound level increases above the threshold, the compressor will only allow the output to increase by 1 dB. A ratio of 3:1 or 4:1 is usually a good starting point; it’s usually gentle enough that the gain-reduction will not be noticeable. Most compressors have a meter that indicates gain-reduction, so watch that meter, and turn the threshold value down, until you start to see gain reduction of a few dB. Taking 3-4 dB off of peaks is a good rule of thumb for gentle compression, and will go a long way toward evening-out the typical spoken voice. There are no strict rules about how much compression is proper; the best thing to do is to listen. Too much compression will make the sound levels pump up and down, or accentuate low-level sounds and breaths in an unnatural way. If it starts sounding weird, raise the threshold setting until it sounds better.
Limiting is simply compression using a very high ratio, perhaps 10:1 or higher. This is too severe to use at low thresholds, but when applied to the very top of the audio range, to treat the very highest peaks, a limiter can control short transient spikes without sounding unnatural. Percussive sounds, an explosively-loud laugh, yelling, and other short sounds that are much louder than the surrounding audio can often be brought under control with careful limiting.
Normalizing, automating levels, compressing and limiting, checking meters to see if the levels are hitting your targets…it’s a lot of work! If only someone could develop an application to do all of that automatically…Well, someone did. There is a tool called The Levelator. It is no longer being developed or supported, so it may not work with your computer or your operating system, but you can always give it a try.
It’s free to download and use. The Levelator only works on .wav or .aiff files, and has been designed to work best on voice. It usually does NOT work well on a mixed piece with music or ambience, but it can work magic on an unaccompanied voice or series of voices. It uses a complicated array of processes that somehow create a very even output loudness from an original source with widely divergent levels. For better or worse, it’s dead simple to use: there are no controls, just drop your .wav or .aiff file on its icon and let it run. I’ve had some recordings come out sounding over-processed and unnatural, but most come out very clean. This is an especially good tool for podcasters, especially ones that integrate several voices.
But – as I mentioned, this program is no longer supported, so it might not work for you. Luckily, similar processing is available from a company called Auphonic. There are free, or paid, services that can adjust your audio levels, clean up noise and more. It’s all automatic, you don’t have much control over what happens, but it can often do a good job of evening-out your audio levels if you don’t have the time to make lots of manual adjustments, or if you just aren’t comfortable working with compressors and limiters.
In a different way, the editing software called Hindenburg Journalist can achieve a similar result. The program has good meters that display loudness, and features the additional attribute of calculating the average level of each sound clip that’s imported into a track, and adjusting it automatically so that it meets a predetermined level standard. The program does a remarkably good job of making each clip have the same apparent volume as the others. What is usually a tedious task of adjusting each clip so that it bounces the meters in the same way is done automatically for you. Of course you may still need to make some tweaks here and there, but for the most part, a lot of the rough mixing is done by the program. Clips are automatically leveled when imported into the workspace, and the levels can be recalculated manually after editing.
Hindenburg Journalist also features a “Voice Profiler” which can automatically apply compression and EQ to a voice track, based on an analysis of the contents of that track. The program also includes a very simple, but great-sounding, compressor plug-in that can be used on any track, including a master track
Additionally, Hindenburg offers loudness normalizing upon export. Individual clips, or complete mixes, can be exported from the program and individually adjusted so that the average levels are precisely -24, -23 or -16 LUFS.
By using a good meter, and trusting your ears, and applying some of the techniques here, you can output a final mix that will sit well next to other professional productions. Stations will thank you, and listeners will thank you.
There are ongoing proposals for new standards for audio levels, that extend from commercial broadcasting, through the record industry, and the film world as well, and so there may be new expectations, and tools to help get there. The push is for greater dynamic range on recorded media, but that’s unlikely to penetrate too deeply into the broadcast world; too many people are listening to the radio, and TV, with lots of background noise, so it’s unlikely that the audience would appreciate quieter voices and louder explosions, at least not on their radios.
The beautiful thing about standards is that you can have so many of them, so be sure to find out what audio levels are expected by the consumers of your productions. Perhaps one day there will only one standard, but for now, it’s a bit unsettled. In the interim, remember, use a Loudness meter, and submissions to ContentDepot or PRX should be mixed with average levels at –24 LUFS, peaks no higher than –3dBfs.
Resources and References
What is a decibel?