Invention Title:

IN-CONTENT VOICE COMMERCE ENGINE

Publication number:

US20260080870

Publication date:
Section:

Physics

Class:

G10L15/22

Inventor:

Applicant:

Smart overview of the Invention

The system and method enable real-time, voice-activated commerce within audiovisual content, allowing viewers to purchase products displayed in programming through natural language commands. It integrates merchant-uploaded "digital twins" of products, content analysis via metadata or AI-powered recognition, and contextual interpretation of queries. The system supports multilanguage functionality and performs end-to-end commerce execution within television or streaming platforms. A monetization framework ensures only registered and verified products are presented, creating a controlled revenue model for brands and content originators. The system also extends to AR, VR, and mixed reality environments.

Background

Digital commerce has transformed consumer interactions, with purchasing decisions increasingly influenced by digital platforms and streaming media. Voice-enabled digital assistants have become common, offering convenience through AI and natural language processing. However, current digital commerce solutions are limited to web-based platforms and separate applications, lacking integration with audiovisual content. There is a growing need for systems that allow seamless commerce within audiovisual and immersive experiences, enhancing engagement and simplifying purchases.

Prior Art

Existing systems, such as those disclosed in U.S. Pat. No. 5,774,664 and U.S. Pat. No. 9,928,532, involve synchronizing TV signals with web content or identifying products through user-submitted images. These methods require users to leave the viewing environment or involve manual steps, resulting in fragmented commerce experiences. Other systems focus on digital shopping assistants and mobile apps but remain disconnected from audiovisual content and voice interaction. No prior art combines real-time content analysis, AI recognition, and natural language commands into a unified commerce engine.

Functionality

The invention integrates product metadata, audiovisual stream analysis, and voice processing into a seamless transaction flow. Merchants register products by uploading "digital twins," ensuring structured cataloging. The system processes audiovisual content through metadata or AI recognition, dynamically associating products with on-screen events. Voice commands trigger product identification, and the system provides actionable responses, including visual overlays and voice outputs. The monetization framework restricts product presentation to registered entries, supporting a controlled marketplace.

Implementation and Applications

The invention completes transactions natively within the media environment, using secure, pre-linked payment accounts. It supports various payment models, including loyalty points and subscriptions. The platform-agnostic design allows implementation across smart TVs, streaming devices, and cloud platforms. Extending to XR environments, the system enables voice-driven product discovery in spatial content. Personalization features tailor responses based on user profiles, while localized availability adjusts options by region. Analytics and reporting tools provide insights into viewer interactions, ensuring data-driven optimization.