When AI Starts Shopping for Us

A shopper does not type “treadmill” anymore.

They ask for a treadmill that is good for indoor marathon training, easy on the knees, and not too expensive.

That small shift changes the whole commerce model. The consumer is no longer moving through a store, page by page, filter by filter, until they find a product. The consumer is bringing intent, constraints, context, and comparison logic into one AI-led interaction.

Agentic commerce is commerce where AI agents help consumers move from intent to action, including discovery, comparison, recommendation, checkout, and post-purchase support.

From discovery to decision – the new shopping baseline

Google’s “From Discovery to Delivery” demo is useful because it does not start with a store. It starts with a runner’s need.

The runner uses AI Mode in Search, Gemini, Lens, YouTube, and product evaluation to move from training intent to shopping decision. The important point is not that Google has another AI demo. The important point is that the old journey has been compressed.

The mechanism is simple: the agent holds the shopper’s intent, constraints, product evidence, comparison logic, and action handoff in one flow.

For enterprise teams, this is where content, product data, consent, analytics, service, and checkout stop being separate workstreams.

Because the agent can connect the shopper’s question to product evidence, policy confidence, and purchase action, the experience feels less like browsing and more like being guided to a decision.

This creates the super-empowered consumer.

The super-empowered consumer is a shopper who uses a personal AI layer to research, compare, interpret, and act faster than any single brand interface.

The stance is clear: agentic commerce is not another campaign surface, it is becoming the default operating layer for consumer experience.

UCP – the rails for agentic commerce

Universal Commerce Protocol is an open standard that lets AI surfaces, merchants, and payment providers work together across commerce actions such as discovery, cart building, checkout, and order management.

This is where the story gets much bigger than one Google feature.

UCP is the moment agentic commerce starts looking less like demo theater and more like commerce infrastructure. The public UCP ecosystem shows names such as Google, Shopify, Etsy, Wayfair, Target, Walmart, Amazon, Microsoft, Meta, Salesforce, and Stripe. Google also says UCP was co-developed with Shopify, Etsy, Wayfair, Target, and Walmart, and endorsed by more than 20 others across the ecosystem including Adyen, American Express, Best Buy, Flipkart, Macy’s, Mastercard, Stripe, The Home Depot, Visa, and Zalando.

That partner list matters because commerce standards only matter when the ecosystem starts treating them as practical rails. One retailer experimenting with an AI assistant is interesting. Google, Shopify, Walmart, Amazon, Microsoft, Meta, Salesforce, Stripe, Visa, Mastercard, and Macy’s appearing around the same agentic commerce direction is a different signal.

The first visible pattern is already clear: Shopify shows how agents can search, cart, and check out through commerce infrastructure, while Macy’s shows how product discovery can become guided decision support inside a large retail catalog.

The real question is not whether shoppers will use agents, but whether brands have made their catalog, content, policies, inventory, identity, consent, and checkout reliable enough for agents to act on.

Shopify – commerce becomes agent-ready

Shopify makes agentic commerce feel real because it moves beyond product recommendations.

It gives agents three things they need to shop properly: a catalog, a cart, and checkout.

The Shopify Catalog lets agents search hundreds of millions of products with real-time inventory and localized pricing. Universal Cart lets a shopper hold products from multiple stores in one place. Checkout Kit loads the merchant’s checkout while keeping the experience native to the AI agent.

That is the shift.

The AI agent is no longer just answering, “Which product should I buy?” It can help find the product, compare it, hold it, and move the shopper toward purchase.

That changes what ecommerce operations means. Product titles, descriptions, categories, attributes, FAQs, return policies, structured data, stock, pricing, and checkout eligibility are no longer hygiene fields buried below the marketing layer. They become the evidence an agent uses to decide whether a product deserves to appear in the conversation.

This is the uncomfortable part. AI shopping does not reward the most beautiful homepage. It rewards the clearest machine-readable decision system.

If the category is weak, the product is harder for the agent to place. If the product description is thin, the agent has less to trust. If policy content is vague, the shopper’s risk question cannot be answered with confidence. If structured data is missing, the machine has to infer what the business should have made explicit.

For larger brands, Shopify is not the whole answer. It is the warning signal. If your website content, product data, images, ratings, customer service answers, consent rules, stock, pricing, and checkout logic do not work together, agents will see the gaps before consumers even reach your store.

Macy’s – when bad search becomes guided selling

Macy’s website shows the Ask Macy’s AI shopping assistant guiding product discovery beside the main shopping page.

Macy’s is the better enterprise example because the starting problem is painfully familiar.

Large catalog. Too many SKUs. Search terms that do not match how consumers actually ask. Results pages that push the shopper back into work instead of helping them decide.

Ask Macy’s changes that pattern. Macy’s and Google turned product discovery into a guided chat, built with Google’s Gemini Enterprise for Customer Experience. The launch was not framed as a novelty chatbot. It was framed as a way to make digital shopping feel more guided, more personal, and closer to the help a shopper might expect in store.

The numbers are what make it worth paying attention to. It has been reported that Macy’s had more than 2.5 million SKUs in its product catalog, launched the tool from a small share of users to half of site users within a day, expanded to 100% a week later, and saw early beta revenue per visit about 4.75x higher among Ask Macy’s users than non-users.

That 4.75x figure should not be copied into a business case as a universal benchmark. It is early beta data, and it may be influenced by user selection, placement, product mix, measurement method, and intent quality.

But the signal is still useful. The commercial value was not that AI answered a question. The value was that the shopper stayed inside a decision path.

That is the difference between AI as a feature and AI as an operating model. A feature answers. An operating model connects the answer to assortment, availability, margin, policy, service, measurement, and conversion.

Enterprise readiness – agents will find the mess

Agentic commerce will not politely ignore weak foundations. It will expose them.

If your product data says one thing on the website, another thing in retailer feeds, and something else in customer service answers, the agent has a trust problem.

If stock, pricing, images, ratings, policies, and checkout rules are not aligned, the agent cannot confidently guide the shopper. It has to guess, skip, or hand off too early.

That is where many brands will struggle. Their consumer experience looks connected on the front end, but behind the scenes the journey is split across brand, ecommerce, CRM, media, legal, analytics, service, and IT.

The consumer will not care which team owns the gap. They will only see the broken answer, the missing product, the wrong promise, or the failed handoff.

Measurement also has to change. Agentic commerce is not only about which channel drove the click. It is about where the decision formed, what evidence the agent trusted, and whether the guided path created a better commercial outcome.

This is why the work is not only technical. It is operating-model work.

Brands need clear ownership for the data, answers, policies, offers, consent rules, measurement, and exceptions that agents will use. Without that ownership, agentic commerce becomes another unmanaged touchpoint.

Agentic commerce readiness – get the data in order

Do not start with every agent, every platform, and every possible integration. Start with the foundation agents will depend on: clean product data, clear policies, accurate stock, usable content, structured attributes, consent rules, service answers, and checkout logic.

That foundation cannot sit inside one team. Agentic commerce cuts across brand, ecommerce, CRM, media, legal, analytics, service, and IT. If those teams do not align what the agent can know, say, recommend, and trigger, the consumer will see the gaps immediately.

Takeaway: choose one high-value decision path, list the shopper questions the agent must answer, verify the product data and policy answers behind those questions, align ownership across the teams involved, connect only the safe commerce actions, and measure whether the guided path improves confidence, conversion, or service effort versus today’s search and checkout flow.


A few fast answers before you act

What is agentic commerce?

Agentic commerce is commerce where AI agents help shoppers move from intent to action, including discovery, comparison, recommendation, checkout, and post-purchase support.

Why does agentic commerce matter now?

It matters because the consumer journey is moving from page navigation to AI-guided decision-making.

What is Google UCP?

Google’s Universal Commerce Protocol is an open standard that helps AI surfaces, merchants, and payment providers work together across commerce actions such as discovery, checkout, and order management.

Why is Shopify important in this shift?

Shopify shows that agentic commerce is not just about better recommendations. Agents need clean product data, real-time inventory, cart logic, checkout handoff, and merchant rules they can safely act on.

What does the Macy’s example prove?

It does not prove that every AI shopping assistant will deliver a 4.75x revenue-per-visit lift. It proves that guided discovery can keep shoppers inside the decision path when search results alone are not enough.

What should enterprise teams do first?

Start with one high-value decision path, get the product data, policies, ownership, consent rules, and checkout logic behind it in order, then measure whether AI-guided discovery improves confidence, conversion, or service effort.

Higgsfield: The Agency Model Challenged

A $47,000 agency quote and a 12-minute AI-generated campaign are not the same thing. But for brand teams, they now sit close enough to make every agency-dependent marketing model uncomfortable.

The useful signal in the Higgsfield Supercomputer demo below is not another AI video trick. It is not another Claude integration story either. It is that work once spread across strategy, creative, production, media, and measurement is now being pulled into one AI-driven workflow.

What makes the demo hard to ignore is not just the cost gap. It is the range of agency work now being challenged at once: strategy, positioning, creative production, ad variants, and distribution setup. That is the pressure point for the traditional agency model.

The setup: brand teams now have a new reference point

The numbers should be treated as a demonstration claim, not a procurement benchmark. The useful point is not whether $47,000 versus $18 is a fair universal comparison. The useful point is that brand teams now have a new reference point for speed, cost, and first-output expectations.

In the demo, the same type of work that would normally move through an agency process is shown as a one-chat workflow: brand book, launch video, ad variants, and campaign setup. That is why the comparison is hard to ignore. It does not prove that every agency output can be replaced. It does prove that the old cost-and-time story now has a serious challenger.

An AI media agent is a software workflow that can interpret a brief, select tools, generate or transform media assets, and return campaign-ready outputs with limited human handoff.

That changes the conversation. A retained agency, internal studio, or platform team can no longer defend every production timeline by pointing to complexity alone. Some complexity is real. Some of it is handoff debt, approval drag, tool fragmentation, and unclear operating ownership.

The mechanism: the brief becomes the production line

The real shift is not that the tool is simply better at making content. It is that more of the work stays together. In a traditional setup, the brief moves across several hands. Strategy interprets the signal. Creative turns it into an idea. Production turns it into assets. Media turns it into variants and tests. Every handoff adds time, cost, and a chance for the original insight to get diluted.

In an AI-driven workflow, one brief can do more of that work upfront. In the demo, the agent is described as reading 247 customer reviews, finding objections, shaping the positioning, creating the launch video, and preparing ad variants. That moves the work from a sequence of separate tasks into one connected workflow.

Because the agent keeps the brief, customer signal, creative options, and test logic together, the team can move faster from consumer insight to market test.

For enterprise teams, this matters because campaign speed is often blocked less by ideas and more by approvals, missing assets, market adaptation, and unclear ownership across the stack.

This does not make the agent the marketing department. It makes the agent a production layer. That layer still needs rules for claims, brand safety, usage rights, market language, measurement, asset ownership, publishing, and media activation.

Why it lands: the visible cost of delay

Why it lands is not because the output is guaranteed to beat agency craft. It lands because delay has become visible.

The real question is whether staying with the traditional path is worth the extra time, cost, and coordination risk.

The demo puts a simple operating question on the table. If a first version can be created quickly enough to test, then the expensive part is no longer the first asset itself. It is the decision work around it. What should be tested? What needs expert craft? What is good enough to learn from? What should wait until the evidence is stronger?

That is where the agency model gets pressured. Not because agencies suddenly have no value, but because production speed alone is no longer enough. Strategy, creative quality, governance, test design, and business learning have to justify the premium.

The stance here is clear: do not treat AI media agents as agency replacements; treat them as a new operating layer that forces every retained agency, internal studio, and platform team to justify its role against speed, quality, governance, and learning value.

Business intent: replace waste, not judgment

The wrong lesson is to use the agent to make more content. That only floods the system.

The better lesson is to use it to reduce the waste between signal and decision. One brief can help mine reviews, test positioning, create product shots, cut social variants, and prepare channel versions. The value is not the pile of outputs. The value is a faster read on what might work.

That is where the business case sits: fewer slow handoffs, cheaper first tests, and faster evidence for what deserves more investment.

Trend mapping belongs in the same logic. If an agent can read what is rising in social platforms and connect it to a brand, product, or category, distribution starts to behave less like a vendor handoff and more like a live operating system.

Before scaling, the workflow needs simple rules. Which claims are approved? Which brand boundaries cannot move? Which markets need language review? Which assets can be used? How are campaigns, variants, and results tracked? Where are files stored? Who signs off?

Without that, the team does not get transformation. It gets more assets to check, more exceptions to manage, and more noise in the system.

The operating test for AI media agents

Use this as a workflow test before you use it as a replacement story. Run the agent against one contained brief and compare the current process with the AI-assisted process on cycle time, revision load, quality threshold, approval effort, cost, and learning speed. The strongest test is not whether the agent makes a prettier video. It is whether the team can move faster from customer signal to creative option, from creative option to market test, and from market test to decision.

Takeaway: AI media agents should first be measured by how much they reduce handoff delay, testing cost, and decision ambiguity. The advantage comes when distribution stops being a vendor relationship and becomes a governed workflow.


A few fast answers before you act

Is Higgsfield replacing marketing agencies?

No. Higgsfield and similar tools pressure the agency model by compressing strategy-to-asset workflows, but enterprise teams still need accountability for brand, legal, media efficiency, measurement, and market learning.

What is the real enterprise use case?

The strongest enterprise use case is not more content. It is faster movement from customer signal to creative option, from creative option to market test, and from market test to decision.

Should teams use AI-generated ads directly?

Only after review. AI-generated ads should pass brand, claims, legal, consent, accessibility, and media-platform checks before they enter paid or owned channels.

Where does Claude matter in this example?

Claude matters as the orchestration surface. Through connectors such as MCP, a language model can call external media tools and turn a written brief into generated assets.

What should an agency now prove?

An agency should prove strategic judgment, distinctive craft, governance maturity, test design, and measurable business lift. Production speed alone is no longer enough.

What is the first practical pilot?

Start with one low-risk product or campaign need. Run the AI workflow against the current process and compare cycle time, cost, quality, revision effort, approval effort, and learning value.

Runway Characters: Real-time AI avatars

A real-time AI avatar is a video-based conversational agent that can listen, respond, and show synchronized facial movement during a live interaction.

Runway Characters is not just another image-to-video feature. It points to a bigger shift: interfaces that talk back, maintain expression, and sit inside websites, apps, support journeys and training environments as an interactive layer.

From chatbot box to embodied interface

For years, the consumer web has treated conversation as a text box. Runway Characters pushes the interaction into a more human-shaped format: a visual character with a voice, a defined personality, domain knowledge and live responsiveness.

The enterprise value is not the avatar; it is the controlled interaction layer around the avatar.

A controlled interaction layer is the set of rules, knowledge sources, permissions, actions, escalation paths and measurement signals that determine what the avatar can say and do.

This is why the product is more interesting for operators than for novelty-watchers. A branded face is easy to demo; turning it into a trusted, scalable and measurable service interface is the hard part.

The mechanism: image, voice, knowledge and action

The mechanism is straightforward: a single reference image defines the character, voice and personality shape the interaction, a knowledge base keeps the response inside a domain, and API actions allow the character to do work rather than just talk.

For enterprise teams, this turns the avatar from a creative asset into a governed service surface that sits between consumers, content, data and workflow.

A governed service surface is a customer-facing interface whose content, permissions, actions, analytics and escalation rules are deliberately controlled.

Because the avatar can combine expression, domain knowledge and actions in the same interaction, the experience can move from navigation to guided execution.

That is the commercial hinge. The avatar is not valuable because it smiles; it is valuable when it helps someone finish a task faster, with less confusion and fewer handoffs.

Where Runway Characters could create real utility

The obvious use cases are the ones Runway highlights: tutoring and education, customer support, training simulations, and interactive entertainment or gaming. Those are credible because the value depends on response, patience, expression and repetition.

The stronger enterprise use case is guided commerce and product selection. A character that understands a product range, asks clarifying questions, checks fit, explains trade-offs and hands off to the right next step could reduce decision friction in categories where consumers need guidance.

Brand and marketing experiences are another useful path, but only if they avoid becoming mascot theatre. A brand character should answer, guide, qualify, educate or convert; otherwise it is just a high-cost animation layer with weak business intent.

The real question is not whether the avatar looks impressive; it is whether the interaction reduces effort, shortens a service path, or improves a decision.

The operating model matters more than the character

The failure mode is predictable: teams launch a polished avatar before defining ownership, content governance, privacy boundaries, escalation logic and measurement. That creates a visible interface with unclear accountability.

For consumer experience platforms, the hard work sits behind the face. The avatar needs approved knowledge, consent-aware data access, clear action limits, analytics events, brand controls, QA scripts and a fallback path when confidence is low.

This also changes the content model. Product information, policy content, service scripts and training material need to be structured enough for a live character to use safely, not just published as static pages for humans to browse.

Runway Characters takeaway for enterprise teams

Runway Characters should be evaluated less like a creative tool and more like a new front-end pattern for service, learning, commerce and brand interaction. The adoption question is not “can we make a character?” but “which consumer or employee journey deserves a live conversational interface, and can we govern it?”

Takeaway: Treat real-time AI avatars as governed service surfaces, not animated brand assets. The winning teams will connect character design to knowledge governance, journey ownership, action permissions, measurement and fallback logic before scaling the experience.


A few fast answers before you act

What is Runway AI?

Runway is an AI company building generative media tools and world-simulation research systems. Runway describes its mission as building AI to simulate the world through the merging of art and science.

What is Runway Characters?

Runway Characters is Runway’s real-time avatar product for creating conversational video characters with customizable appearance, voice, personality, knowledge and actions.

Why does it matter for brands?

It matters because it can turn static content, support flows and training material into live guided interactions that feel more natural than a chatbot.

What are the best first use cases?

The best first use cases are narrow, repeatable journeys where guidance reduces effort: product advice, customer support triage, onboarding, training practice and education.

What is the main enterprise risk?

The main enterprise risk is launching a convincing avatar without clear governance over what it knows, what it can say, what it can do and when it must escalate.

How should teams measure success?

Teams should measure task completion, deflection quality, conversion support, time saved, escalation rate, user satisfaction and the cost of maintaining the knowledge base.