Automated & Virtual Product Placement (VPP)

The advertising ecosystem has spent years pursuing an apparently contradictory objective: increasing content monetization without compromising the user experience. Across linear TV, OTT, AVOD, and FAST services, growth in advertising inventory has traditionally been achieved by adding more ad breaks or increasing ad load per hour. However, this approach has clear limitations in an environment where audience attention is becoming an increasingly scarce resource.

Against this backdrop, technologies such as Automated Product Placement (APP) and Virtual Product Placement (VPP) are generating growing interest. While industry terminology has yet to be fully standardized, both concepts are commonly used to describe the digital insertion, modification, or replacement of products, brands, and promotional assets within audiovisual content after production has been completed.

The underlying technology is not new. Early virtual insertion systems first emerged decades ago in sports broadcasting, where physical signage could be replaced with virtual advertising tailored to different geographic markets. What has changed dramatically is the level of automation, scalability, and contextual intelligence made possible by recent advances in artificial intelligence and computer vision.

However, the real value of this technology goes beyond its technical sophistication. For broadcasters, OTT platforms, and FAST operators, its transformative potential lies in something far more significant: the ability to turn traditionally static elements within content into a new source of addressable advertising inventory.

From stage prop to advertising asset

Traditional product placement has been part of the audiovisual industry for decades. Its operating model is relatively straightforward: a brand negotiates with the production company to secure the presence of a product or logo during the filming of a movie, series, or television program.

However, this model comes with significant operational limitations. Integrations are permanent, difficult to modify, and typically tied to commercial agreements finalized before production begins.

Virtual Product Placement fundamentally changes this paradigm.

Using advanced video analysis techniques, platforms can identify surfaces, objects, and spaces within already-produced content that can accommodate advertising elements. Billboards, storefronts, digital screens, product packaging, vehicles, and even decorative objects can be transformed into virtual advertising placements without physically altering the original content.

From a monetization perspective, the shift is substantial. Elements that were once static components of the narrative become commercial assets that can be managed and monetized dynamically.

Technology Architecture: How It Actually Works

While each vendor relies on proprietary technology, the underlying architecture typically consists of four main layers.

Semantic Scene Understanding

The first step is to analyze the audiovisual content in order to identify objects, surfaces, and spaces suitable for advertising insertion.

Computer vision models generate semantic maps capable of distinguishing actors, furniture, vehicles, backgrounds, and other elements that could potentially serve as advertising surfaces.

The output of this stage is not yet an inserted creative, but rather a technical inventory of placement opportunities. Each opportunity is described through a rich set of metadata, including start and end timecodes, surface polygon coordinates within the frame, visibility percentage, temporal stability, relative size, estimated depth, orientation, blur level, exposure, contrast, potential occlusions, and editorial constraints.

This represents the first major difference compared to a traditional overlay. With an overlay, the ad is simply rendered on top of the video. In Virtual Product Placement, the advertising asset is integrated into a pre-calculated visual geometry within the scene itself.

Spatial Reconstruction, Camera Tracking, and Scene Mapping

Once an insertable space has been identified, the system must solve a far more complex challenge: understanding how that space moves throughout the shot.

To achieve this, depth estimation, camera tracking, and spatial reconstruction algorithms are used to calculate perspective, distance, and motion. In practical terms, the pipeline creates a sort of "parallel technical scene" alongside the original video.

The video itself remains two-dimensional, but the system builds a spatial interpretation of the scene: where the background is located, where objects exist in three-dimensional space, which surfaces are planar, which elements are closer to the camera, and how everything moves over time.

If the camera performs a dolly movement, pan, or zoom, the inserted creative must respond accordingly, maintaining consistent scale, orientation, and relative positioning throughout the sequence.

Visual Integration and Relighting

Once the creative has been placed correctly and all occlusions are respected, the most important challenge still remains: making the inserted element appear as though it was captured by the same camera, under the same lighting conditions, and within the same environment.

From a technical standpoint, this involves adapting the inserted asset to the lighting and photographic characteristics of the original shot.

The process typically begins by working within the appropriate color space. The final video may be delivered in Rec.709, HDR, Log, P3, or another format, while also containing compression artifacts, gamma curves, LUTs, or color grading adjustments.

To achieve a convincing integration, the system must interpret the source image and adapt the inserted asset to the same luminance, contrast, and color behavior.

The most advanced platforms analyze lighting, shadows, reflections, depth of field, and image texture in order to dynamically adapt the creative to the existing visual environment.

They also manage occlusion handling, allowing actors or objects to pass in front of the placement without breaking the visual illusion.

Rendering, Ad Activation, and Distribution

This is the layer where Virtual Product Placement begins to intersect with the AdTech ecosystem.

Depending on the provider, creatives may be integrated during post-production workflows or served dynamically during content distribution.

In a mature deployment, every creative should be accompanied by a comprehensive metadata layer, including product category, approved territories, campaign dates, excluded competitor brands, contextual restrictions, language versions, minimum size requirements, readability thresholds, and approval rules.

Once tracking, occlusion handling, perspective, and visual integration have been resolved, the system generates the final version of the shot.

Technically, this can be achieved in three ways. The first is baked-in rendering, where a new master is created with the brand permanently integrated into the content. The second is content versioning, where multiple variants of the same asset are generated for different markets, campaigns, or advertisers. The third is a more dynamic model, where the decision regarding which creative to insert is made closer to playback time.

In OTT and CTV environments, the most practical approach is typically a hybrid one. Content is analyzed in advance, insertable spaces are transformed into metadata-driven placement slots, and creatives are rendered into controlled content versions before distribution.

This approach helps avoid challenges related to latency, quality assurance, CDN caching, DRM workflows, and consistency across the various renditions of the ABR ladder delivered through HLS or DASH.

For readers interested in seeing the technology in action, this video provides a useful overview of how Virtual Product Placement works:

Market Evolution: From Virtual Sports Advertising to Generative AI

The history of Virtual Product Placement reflects a familiar pattern in the technology industry: many of the ideas gaining momentum today are not entirely new. In many cases, they emerged decades ago, but the technical, operational, and commercial constraints of the time prevented them from achieving large-scale adoption.

First Generation: Arriving Too Early

The origins of the sector can be traced back to pioneering companies such as Princeton Video Image (PVI), widely regarded as one of the founders of modern virtual advertising insertion. As early as the 1990s, PVI demonstrated that it was technically possible to replace physical advertising with virtual advertising during live sports broadcasts.

However, the technology arrived before the market was ready. The required infrastructure demanded enormous processing power, dedicated hardware deployed inside stadiums, and highly complex real-time rendering and transmission systems. At the same time, the advertising industry had not yet developed many of the concepts that are now fundamental to digital media, including audience segmentation and addressable advertising.

Although PVI eventually disappeared as an independent company, much of its technological legacy survived through the acquisition of its assets and patents by ESPN in 2010.

Second Generation: Proving Commercial Viability

During the 2010s, companies such as Mirriad expanded the concept beyond sports and into films, television series, and premium entertainment content. Their key contribution was demonstrating that both advertisers and content owners had a genuine interest in integrating brands into content that had already completed production.

The challenge, however, remained scalability.

While the technology was capable of delivering highly convincing advertising integrations, much of the workflow still depended on human supervision and specialized post-production teams responsible for validating and refining each placement. As a result, campaigns remained relatively expensive and the volume of content that could be processed was inherently limited.

In recent years, Mirriad has evolved toward far more automated approaches built on artificial intelligence and deeper integration with the broader AdTech ecosystem.

Third Generation: Streaming, Addressability, and Generative AI

The current generation is represented by companies such as Ryff and Rembrand, each approaching the challenge from a different but complementary perspective.

Ryff's approach focuses on bringing advertising insertion closer to the point of consumption. Its vision is to decouple placements from the video master, allowing the same piece of content to display different brands depending on the viewer, market, or playback context. This approach aligns more closely with the principles of addressability and personalization that define modern OTT, AVOD, and FAST environments.

Rembrand, meanwhile, represents the arrival of generative AI within this category.

Where traditional solutions relied on inserting predefined graphic assets, newer AI-driven technologies are increasingly capable of understanding the visual context of a scene and generating advertising objects or brand integrations in a far more organic and automated manner. The goal is no longer simply to create believable placements, but to dramatically reduce operational costs while enabling thousands of hours of content to be processed at scale.

The convergence of artificial intelligence, automation, and digital distribution is accelerating the maturation of the sector. For the first time, the industry simultaneously possesses the technology, computing power, and distribution infrastructure required to transform Virtual Product Placement into a genuinely scalable advertising category.

Future Capabilities and Remaining Challenges

If the first phase of Virtual Product Placement was focused on proving technical feasibility, and the second on automating execution, the next stage will be defined by its integration into digital monetization ecosystems. However, for this evolution to scale, the industry will need to address significant challenges around measurement, standardization, and the commercialization of this new form of inventory.

Automated Trading

One of the most compelling hypotheses surrounding the sector is the potential convergence between Virtual Product Placement and programmatic buying infrastructure.

Today, most campaigns continue to be managed through direct sales or semi-automated workflows. The market's natural evolution, however, points toward models in which advertising opportunities identified within a scene can be traded through technology platforms similar to those currently used for digital video and Connected TV.

For these models to become viable, several fundamental questions must first be addressed. What constitutes a valid impression? How should placement visibility be measured? Which verification and auditing mechanisms can be applied?

The industry will also need to determine how this inventory fits into the broader advertising value chain and what roles broadcasters, content owners, technology platforms, and media buyers should play in its commercialization and delivery.

Continuous Catalog Monetization

For broadcasters, studios, and streaming platforms, one of the most significant opportunities lies in extending the commercial lifespan of existing content libraries.

Historically, the monetization of a film or television series has been largely determined by its distribution windows and the advertising formats associated with each of them. Virtual Product Placement introduces a new variable: the ability to refresh, update, and monetize selected visual elements years after the original production has been completed.

Viewed through this lens, content libraries cease to be static assets and instead become continuously monetizable environments. Campaigns, brands, and creative messaging can be adapted to new markets, audiences, and time periods without requiring any modification to the original work itself.

Contextual Intelligence and Interactivity

Another area with significant long-term potential lies at the intersection of contextual AI and interactive experiences.

As AI models become increasingly capable of understanding scenes, emotions, and narrative context, it becomes technically feasible to select creatives not only based on who is watching the content, but also on what is happening within the story itself.

At the same time, the continued adoption of Smart TVs and interactive viewing experiences is creating new opportunities for audience engagement. Virtually inserted elements could evolve into interactive touchpoints capable of triggering product discovery experiences, promotional offers, or even direct purchasing journeys.

In this scenario, Virtual Product Placement begins to move beyond traditional brand integration and closer to emerging concepts such as Shoppable TV.

The question is no longer whether the technology can integrate a brand into a scene in a believable way. That technical barrier appears to be rapidly disappearing. The real challenge is whether the industry can establish the standards, measurement frameworks, and attribution models needed to support this new category of inventory.

If that happens, Virtual Product Placement could emerge as a distinct advertising category within the broader audiovisual ecosystem. Its primary value would not be to replace existing formats, but to expand the pool of monetizable inventory by unlocking spaces that have historically existed solely as part of the content itself.

For the first time, visual elements traditionally regarded as part of the set design, production environment, or narrative backdrop could be managed as standalone advertising assets. The result would be new monetization opportunities that do not necessarily require increasing the advertising load perceived by the viewer.