AI WORKFLOWSApril 28, 2026

How We Generate B-Roll with Higgsfield Cinema

We stopped licensing stock footage last year. The replacement isn't cheaper stock — it's Higgsfield Cinema, and the workflow took about three weeks to dial in.

The problem with stock B-roll isn't cost, it's specificity. You need a shot of someone loading shipping containers at a Colombo port at dusk, and the library has Seattle in 2014. You license it anyway. The client notices. The spot looks like it could belong to anyone.

Higgsfield solves this at the prompt level. The model understands camera language — dolly moves, rack focus, lens flares from specific focal lengths. You're not describing a scene; you're calling a shot.

The Prompt Framework We Use

After several hundred generations, we settled on a three-part prompt structure.

Subject + action + environment. Don't lead with mood or aesthetic. Ground the model in something concrete first: "A cargo ship at dusk entering a port" performs better than "dramatic maritime sunset." The cinematic quality comes from the second layer.

Camera directive. Higgsfield responds well to specific camera language: "slow dolly push," "handheld follow," "locked-off wide." Be literal. "Cinematic" as a descriptor is nearly worthless.

Texture qualifier. This is where the look gets locked. We reference physical cameras: "shot on ARRI Alexa," "anamorphic bokeh," "35mm grain." These steer the model away from the plastic AI-look into something that holds up next to live footage.

A prompt that works in production:

Cargo ship entering port at dusk, slow dolly push from dock level,
shot on ARRI Alexa Mini LF with 40mm anamorphic, warm sodium light
on steel hull, light haze, birds in background, no people

Matching to Live Footage

The hardest part isn't generation — it's integration. AI footage and live footage have different noise patterns, colour primaries, and temporal consistency. We run a two-step process in DaVinci Resolve.

First, colour-match using the AI shot as the source and the live footage as the target. This gets the overall look close. Then we add film grain manually — Resolve's built-in grain node set to match the noise pattern of our camera package.

The result: in three commercial edits this year, not one client identified the synthetic shots in review. That's the benchmark.

What It Doesn't Do Well

Higgsfield still struggles with faces. Any close-up with a human subject drifts into uncanny territory within a few seconds. We don't use it for talent-focused work — that's still a camera job.

It also has trouble with fast cuts. The model generates at 3–6 second clips and motion continuity breaks down if you try to match cut between two AI-generated shots. We use AI B-roll as cutaways and inserts, not as sequences.

The Economics

A three-day licensed stock package for a mid-sized commercial runs $800–2,000. Higgsfield is a monthly subscription. On a per-project basis, we're at roughly 15% of the previous stock budget with footage that's actually specific to the brief.

The trade-off is time. Generation, review, and integration add half a day to post on a typical project. At our billing rate, it breaks even around the $500 stock budget mark. Below that, we still occasionally license.

← PREVIOUS

Synthetic Media at Scale: Building the Prompt Stack