Building a complete audio world in DaVinci Resolve Fairlight — five layers, foley, and the finished mix
Focus: sound design philosophy — how audio creates emotional reality independently of the image, and how professional mixes are architecturally structured.
The five layers of a professional mix(1) Dialogue — primary human speech, always the highest priority. (2) Ambience/room tone — gives the world continuity and presence. Without ambience, spaces feel dead. (3) Sound effects (SFX) — specific sounds tied to visual events. (4) Foley — custom-recorded sounds that replace or enhance real sounds (footsteps, clothing, object handling). (5) Music — score or licensed tracks that drive emotion and pacing. All five interact — managing their relative volume and frequency content is the art of mixing.
Contrast as sound design — silence, texture, dynamic rangeThe most powerful tool in sound design is not a sound — it is the absence of one. Silence after sustained noise creates shock and focus. A sudden switch from the loud, busy sound world of a city to near-silence underlines a character's isolation. Dynamic range in audio — the contrast between the quietest and loudest moments — is what makes both the quiet and the loud feel significant.
Diegetic vs non-diegetic soundDiegetic sound exists within the story world — the characters can hear it. A radio playing in a kitchen, a car horn, footsteps. Non-diegetic sound exists outside the story world — only the audience hears it. The score, a narrative voice-over, a sound motif. The blurring of this boundary — diegetic source music that gradually becomes non-diegetic score — is one of cinema's most sophisticated emotional transitions.
Spotting a film for soundBefore designing any sounds, spot the film: watch it through and note every moment that needs audio attention. Create a spotting list: timecode, description of what needs to happen, and whether it requires SFX, foley, or music. Common spots: hard sound effects (door opens at 00:34), foley opportunities (footsteps throughout scene 2), ambience changes (transition from interior to exterior at 01:23), music moments (theme enters at 02:00).
The Fairlight Sound LibraryResolve's Fairlight Sound Library (View → Show Sound Library) contains thousands of categorised SFX — ambiences, foley, vehicles, weather, human sounds, and more. Before purchasing external libraries, explore the built-in library thoroughly. For most Phase 1–3 projects it provides everything you need. External libraries (Soundly, Soundsnap) become valuable when you need very specific sounds for specialist work.
Drill 1
Spot a film — create a spotting list
Take your Phase 1 short film. Watch it through without stopping and create a complete spotting list: every moment that needs SFX, foley, ambience, or music. Use timecodes. Categorise every entry. Count: how many separate audio events does your 2-minute film require? Most beginner films have fewer spots than they should — add to the list places where absent sounds would add realism or emotion.
Drill 2
Ambience layer build
Take any scene from your footage that has primarily dialogue and minimal ambient sound. Build a complete ambience layer using Resolve's Fairlight Sound Library: find an interior ambience that matches the space (office, home, outdoors), place it at low level (-30 to -25 dBFS) under the entire scene. Then add smaller spatial events: distant traffic, a bird outside, an air conditioner hum. Compare the scene with and without the ambience layer.
Drill 3
SFX replacement exercise
Find a scene with clear physical sound events (a door opening, footsteps, an object being picked up). Mute all production audio on those events. Replace each with a sound from the Fairlight Sound Library or a custom-recorded sound. The replaced sounds should feel more present and clear than the production audio originals. Real recorded sounds on set are often less convincing than purpose-recorded SFX.
Drill 4
Silence as sound design
Take a 30-second clip with continuous ambience and dialogue. Create a new version where all non-dialogue sounds drop to complete silence for a 3–4 second period at a dramatically significant moment, then return. The silence should feel like a punctuation mark — emphasising something. Watch both versions and note whether the effect works, and why.
Week 1 Assignment
"Complete spotting list"
Produce a complete spotting list for your Phase 1 short film. The list must include: timecode, audio event description, category (SFX/Foley/Ambience/Music), priority (essential/enhancing), and a note on where you will source the sound. Minimum 20 entries for a 2-minute film — if you have fewer than 20, you are not listening closely enough.
Minimum 20 entries for a 2-minute film
Every major physical event in the film is spotted
Ambience transitions are identified and spotted
Music moments are clearly identified with emotional intent noted
DaVinci Resolve FairlightATH-M40X headphones
Focus: foley recording, SFX sourcing, and building a personal sound library from custom recordings.
Foley — the art of custom sound recordingFoley is the practice of recording sounds post-production to accompany picture: footsteps, clothing movement, object handling. In a one-person workflow, you record foley in your own space using whatever produces the right sound — it doesn't matter what the source object is, only that it sounds right in context. A crinkled plastic bag sounds like fire. Rice pouring sounds like rain. Foley is about creative problem-solving, not literal duplication.
Recording foley with the Rode NTGThe NTG's directional pickup pattern is ideal for foley — it rejects room ambience and focuses on the close, specific sound. Mount the NTG on a shock mount 15–30cm from the foley source. Record in your quietest space. Record each foley element separately: all footsteps in one pass, all door sounds in another. This makes cutting foley to picture much easier than mixed recording passes.
Building a personal SFX libraryEvery location you visit is an opportunity to record sounds for your library. Record: traffic from different perspectives (close, mid-distance, far), water in different situations, wind in different environments, crowds, birds in your region, machinery, doors in different buildings, footsteps on different surfaces. Organise by category: Ambience/Outdoor, Ambience/Indoor, Transport, Human/Foley, Nature, Mechanical. Label every file precisely.
Sound design for emotion — frequency and textureLow-frequency drones (20–80Hz) create tension and unease — often inaudible as a distinct sound but felt in the body. High-frequency sustained tones suggest anxiety or tinnitus. Warm, mid-range tones (800Hz–2kHz) feel human and present. Metallic, resonant sounds feel industrial and cold. A score that introduces a very low, barely audible drone before a threat appears tells the subconscious something is wrong before the conscious mind registers it.
Pitch and time manipulation of SFX in FairlightFairlight's clip attributes allow pitch shifting and time stretching. Pitch a door creak down an octave and it becomes a ship hull groaning. Pitch up rain and slow it down and it becomes something alien. Time stretch a heartbeat and it becomes a low-frequency percussion hit. These transformations are the foundation of abstract and science fiction sound design.
Drill 1
Record a foley session
Take your Phase 1 short film and record foley for every physical action in it: footsteps (walk in place on appropriate surfaces), door sounds, object handling, clothing movement. Record each category in a separate clean pass using the Rode NTG in your quietest space.
Drill 2
Location sound library recording
Visit three different locations in Adelaide over one session. At each, record: 3 minutes of background ambience, and 10 distinct sound events. Come back with at least 30 unique usable recordings. Organise and label everything before moving on. This session starts your personal sound library.
Drill 3
SFX pitch manipulation
Take 5 sounds from your library or Fairlight's Sound Library. In Fairlight's Inspector, pitch each down by 12 semitones (one octave) and listen. Then pitch up 12 semitones. Then apply 200% time stretch. Document what each sound becomes. A door creak becomes what? A bird call becomes what? A footstep becomes what?
Drill 4
Match foley to picture
Take a 30-second clip where someone walks across a room and picks up an object. Mute all production audio. Record foley for every footstep, the clothing movement as they bend, and the object being picked up. Cut the foley to match picture precisely — each footstep lands on the frame where the foot touches the floor. This sync-match skill is the core of foley work.
Week 2 Assignment
"The foley session"
Record and cut complete foley for a 60-second scene. Every physical action must have custom-recorded foley replacing the production sound. The foley must be in sync with picture. Deliver the scene with and without foley for comparison, plus a behind-the-scenes note describing what you used to create each sound.
Every physical action has foley coverage
Sync is accurate — sounds land on the correct frame
Foley is believable — it sounds like what it represents
BTS notes reveal creative problem-solving in sound-to-source choices
Focus: music licensing, editorial music, tempo matching, and the music-picture relationship.
Music licensing for professional useUsing commercial music without a licence in client work or public-facing professional deliverables creates copyright infringement liability. Options: royalty-free music libraries (Artlist, Musicbed, Epidemic Sound, Soundstripe — annual subscription models), direct licensing from artists, or original commissioned score. For YouTube: Content ID will claim your video. Never use commercial music in client deliverables without a specific synchronisation licence.
Selecting music — tempo, key, and emotional registerSelecting music for a film is not about finding a track you like — it is about finding a track that serves the picture. Consider: tempo (does the music's pulse complement the edit pace?), emotional register (does the music's character match the scene's emotional content?), key/mode (major = positive; minor = melancholic; modal = ambiguous), and dynamic range (does the music have quiet and loud moments that map to the film's structure?).
Editing music to lengthRoyalty-free tracks are rarely exactly the length you need. Techniques: trim at a phrase end (the end of a musical sentence, not mid-bar), fade out over the last 8–16 bars, loop a section by copy-pasting a phrase to extend the track, or cut from the intro directly to a later section. In Fairlight, use audio cross-fades at loop points to smooth the join. Pitch-corrected time stretch can extend or compress a track by up to 15–20% without audible artefacts.
Fairlight automation — music duckingWhen dialogue is present, music should duck to approximately -24 to -28 dBFS — much lower than it seems natural to set it. Use Fairlight's volume automation: click the automation icon on the music track, set to 'write,' play through the timeline while moving the fader, then set to 'read' for playback. Use curve handles on the automation keyframes to create gradual rather than sudden changes.
Score vs source musicScore (underscore) is non-diegetic music — it comments on the action and creates emotional context. Source music exists within the story world — a radio playing, a band performing. The choice between score and source has storytelling implications. Score places the audience outside the scene looking in. Source places them inside the scene — sharing the character's experience.
Drill 1
Same scene, three music tracks
Take a 90-second scene with no dialogue. Find three different music tracks with genuinely different emotional registers — warm and optimistic, tense or melancholic, energetic. Edit each under the same picture. Watch all three versions. Note specifically: where do the cuts feel different in timing? Does the scene's meaning change? What do you feel differently about the character in each version?
Drill 2
Edit a music track to length
Take a 3-minute music track and edit it to exactly 90 seconds. The result must sound like a complete piece — not a truncated version. Use phrase-end cuts, a crossfaded loop, and a fade out. The join points must be inaudible on first listen on ATH-M40X. This practical skill is used in virtually every professional video edit.
Drill 3
Fairlight automation — music ducking
Build a 2-minute timeline with dialogue and continuous music. Use Fairlight's volume automation to automate the music: full level during instrumental breaks, ducked 12–15 dB under dialogue. The automation should be smooth — use curve handles on the automation keyframes to create gradual rather than sudden changes.
Drill 4
Source to score transition
Design a sequence where diegetic music (a song playing from a speaker in frame) gradually transitions to non-diegetic score as the scene becomes emotionally significant. In Fairlight: start with the source music at realistic level and with room ambience on it. As the transition begins, gradually remove the room ambience and smoothly crossfade to a different version of the music without room ambience. The audience should not notice the exact moment the music left the story world.
Week 3 Assignment
"Music-driven short"
Produce a 90-second short piece where music is the primary emotional driver. The edit should be cut to the music's structure, and the music's emotional arc should match and enhance the visual story. Deliver with a note explaining: the music choice rationale, how you edited the music track to length, and how you managed the dialogue-music balance.
Music is correctly licensed for use (note the licence type)
Edit cuts reflect musical phrase structure, not just beat hits
Dialogue (if present) is clearly audible above the music at all times
Music has been edited to length seamlessly
DaVinci Resolve FairlightATH-M40X headphones
Focus: the complete sound design pass — taking a film from raw production audio to a finished, layered mix delivered at broadcast standard.
The mix hierarchy — establishing levels and prioritiesBefore a final mix, establish a level hierarchy: set dialogue to -12 dBFS average. Music under dialogue: approximately -24 to -28 dBFS average. Ambience: -30 to -35 dBFS average. Spot SFX: -18 to -12 dBFS at peak events. This hierarchy keeps the mix intelligible and prevents any layer from dominating inappropriately. Adjust from this baseline rather than setting levels intuitively without reference.
Spatial audio — stereo field and sense of placeThe stereo field creates a physical sense of space. Ambience should be spread across the full stereo field (±100%) — this creates the sensation of being inside an environment. SFX with a visual position should pan with that movement (a car passing left to right — pan hard left entry, crossing centre, hard right exit). Dialogue stays centred. These spatial choices work below conscious awareness but create physical believability.
Reverb and delay — creating spatial depthReverb simulates the acoustic signature of a space. Adding a short, small-room reverb to dialogue recorded very close to the subject (which sounds too dry and present) makes it feel more naturally placed in the scene. Key principle: all sounds in the same space should share the same reverb character — they should sound like they're in the same room. Sounds in different spaces (a phone call, a voice from another room) should have different reverb characteristics.
Stems delivery — M&E and dialogue separationA professional delivery often requires stems: separate audio exports of each major layer. Dialogue stem: just the dialogue tracks. Music stem: just the music. Effects stem: SFX and foley without dialogue. In Fairlight, route each layer to its own bus, then export each bus as a separate file alongside the full mix. Verify the stems combined reproduce the original mix exactly.
The final listening session — three system checkBefore any delivery: listen to the complete mix on three different systems. First: ATH-M40X (your primary monitor — catch technical issues). Second: built-in Mac speakers (reveals imbalances in the midrange). Third: earbuds or a small Bluetooth speaker at normal listening volume (reveals the mix's emotional impact in casual context). Address issues found across all three before export.
Drill 1
Build the complete mix hierarchy
Take a 2-minute piece with dialogue, ambience, SFX, foley, and music all on separate tracks. Without listening, set initial levels according to the hierarchy in the theory: dialogue at -12 dBFS, music at -24 to -28, ambience at -30 to -35, SFX at -18 peak. Now listen. How close did the non-listening hierarchy get you to a workable starting balance?
Drill 2
Stereo field placement
Take a scene with at least 3 SFX events with clear spatial positions. Pan each SFX event to match its visual position. Add movement for anything that crosses the frame. Listen on ATH-M40X with your eyes closed — can you reconstruct the visual geography of the scene from audio alone? A well-panned mix should make this possible.
Drill 3
Reverb matching exercise
Take a scene where dialogue was recorded very close to the mic (very dry, close presence). Find a Fairlight reverb preset that matches the acoustic character of the filming location. Apply the same reverb (at low wet mix, around 15–20%) to the dialogue, the foley footsteps, and the spot SFX. All three elements should now feel like they exist in the same physical space.
Drill 4
Stems export
Take your complete mix and export three stems: Dialogue, Music, and SFX (including foley and ambience). In Fairlight, create three additional buses, route the appropriate tracks to each bus, and export each bus as a separate stereo file. Import all three back into a new Resolve timeline and combine them — verify that the three stems combined reproduce the original mix exactly.
Week 4 Assignment
"Complete sound design pass"
Take your Phase 1 short film and produce a complete, professional sound design pass: full foley, complete ambience layers, all SFX spotted and placed, music edited and mixed, everything at -14 LUFS / -1 TP. Deliver: the finished film with full mix, three stems (Dialogue, Music, SFX), and a written sound design document describing every significant audio decision.
All five audio layers are present at appropriate levels
Music is correctly mixed and ducked under dialogue
Stems combine back to the full mix exactly
Loudness: -14 LUFS / -1 TP
Rode VideoMic NTGDJI Mic 2ATH-M40X headphonesDaVinci Resolve Fairlight
Mixing only on headphones without checking on speakers
Closed-back headphones produce an artificially large, separated stereo image that doesn't represent how a mix sounds on speakers. A mix built entirely on headphones often has too much stereo width, too much bass, and balance decisions that fall apart on laptop speakers — which is how most viewers hear your content.
Fix: Complete your primary mix on the ATH-M40X but always check on at least two other systems before delivering. Mac built-in speakers reveal midrange imbalances. A small Bluetooth speaker at low volume reveals the mix's emotional impact in casual listening conditions.
Music that drowns dialogue
Music that feels appropriately present when no one is speaking becomes overwhelming under dialogue — the two compete for the same frequency range.
Fix: When dialogue begins, music should duck to approximately -24 to -28 dBFS. Use automation to control this transition rather than setting a single compromise level. The drop should be inaudible because the dialogue takes the listener's attention.
Missing room tone — dead air between dialogue edits
Every dialogue cut not covered by room tone has a brief moment of completely dead audio. These 'silences' are immediately noticeable even if the viewer can't name what they heard.
Fix: Always record room tone at every location (60 seconds minimum). Place a loop of room tone on a dedicated track under all dialogue throughout the scene, set to a subliminal level but continuous.
Use Fairlight's built-in Sound Library before buying external libraries
Resolve's Fairlight Sound Library (View → Show Sound Library) contains thousands of categorised SFX. For most Phase 1–3 projects it provides everything you need. External libraries become valuable when you need very specific sounds for specialist work.
DaVinci Resolve Fairlight
The DJI Mic 2 internal recording as a sound design source
The DJI Mic 2 transmitter records internally as a backup — but the internal recording also captures ambient sound very differently from the main signal chain. It occasionally captures environmental sounds or room character that wasn't recorded cleanly on the main channel. Import the internal recording of every session and listen through it for ambient sound design material you may have captured accidentally.
DJI Mic 2
Score your own films — even if you're not a musician
Apps like GarageBand (free on Mac), Logic Pro, and Splice make it possible to build effective underscore without traditional musical training. Start with a single sustained pad or drone at a low level under a scene and listen to how it changes the emotional register. Add elements one at a time. The simplest possible score — one or two instruments — is often more effective than a dense orchestral arrangement.