There's a rule in information science that most retailers have never heard but will feel in their revenue within 36 months:
Unindexed content does not exist.
It doesn't matter how good it is. It doesn't matter how much you spent developing it. If a search system can't read it, it isn't there. This was the hard lesson of early SEO. The businesses that described their web pages in structured, crawlable terms got found. The businesses that buried information in Flash animations and image-only layouts became ghosts.
We are about to learn this lesson again. But the stakes are higher, the timeline is shorter, and the "search system" is no longer a person typing keywords into Google.
It's an AI agent making a purchase decision in milliseconds on behalf of a consumer who will never see your product page.
The real threat
What most mid-sized retailers believe the threat landscape looks like: Amazon on one side, Shein on the other, and margin compression in the middle. The strategic response is predictable. Cut costs. Optimize logistics. Maybe launch a loyalty program.
None of that addresses what is actually coming.
In agentic commerce, an AI agent acts on behalf of a shopper. The shopper says something like: "I need a dress for a rooftop wedding in June, I run warm, I want to look put together but not overdressed, and I don't want to spend more than $300." The agent doesn't browse. It queries. It pulls structured product data from every merchant in its index, scores each item against the shopper's constraints, and returns a curated set of options.
Now ask yourself: what does your product feed actually say about that item on page 14 of your catalog?
If the answer is "Black dress. Versatile. Perfect for any occasion" you just got filtered out. Not because your product was wrong. Because your description was too thin for the agent to evaluate. You were semantically invisible.
This is not a future scenario. Shopify's Model Context Protocol is live. OpenAI's agent APIs are in production. Google's shopping agents are indexing product feeds right now. The infrastructure for agent-mediated commerce is being built while most retailers are still debating whether to add a chatbot to their homepage.
The margin compression retailers fear isn't coming from a competitor with lower prices. It's coming from their own product feeds.
The retrieval problem
In 1965, information scientist Gerard Salton developed the Vector Space Model, one of the foundational concepts behind modern search. The core principle is simple: a document is only as findable as its representation in the index.

If you write a brilliant paper about "cardiac events in post-operative patients" but your metadata only says "heart stuff," no search system will surface you for the right query. The content is there. The retrieval path is broken.
Retail is sitting on the largest retrieval failure in commerce.
The industry spends billions on product development. But then it describes those products with four words and a stock photo. A $2,000 jacket might have "wool blend blazer / navy / classic fit" as its entire machine-readable identity.
Compare this to what a knowledgeable sales associate would say about the same jacket: "Structured but not stiff. Cool navy with a slight indigo undertone. Best paired with tapered trousers or dark denim, not chinos. Works for a creative office or a dinner reservation, not a formal board meeting. Runs slightly narrow in the shoulder for athletic builds."
That associate just generated metadata worth thousands of dollars in conversion lift. And none of it exists in any product feed.
The lemons problem
George Akerlof won the Nobel Prize for describing this exact dynamic. In his 1970 paper "The Market for Lemons," he showed that when buyers can't distinguish between high-quality and low-quality products due to poor information, the market collapses to the lowest common denominator. Sellers of quality goods exit, and only "lemons" remain.
Thin product feeds are creating a lemon market. Not because the products are bad, but because the descriptions are too thin for any system (human or machine) to tell the difference between good and great.
When an AI agent can't distinguish your $280 Italian-milled ponte dress from a $45 fast-fashion knockoff based on structured data alone, it will default to price. And you lose.
What the fix actually looks like
The fix is not a better chatbot. The fix is not a better recommendation carousel. The fix is treating product metadata as a strategic asset with the same rigor you'd apply to inventory management or pricing strategy.
1) Recognize that metadata is not a content task. It is an intelligence task.
Writing product descriptions and building semantic metadata are two completely different activities. A product description sells to a human. Metadata sells to a machine. The first is prose. The second is structured, attribute-rich, context-aware data that tells an agent exactly who this product is for, what it pairs with, what occasions it fits, and what constraints it satisfies. This is not copywriting. This is product intelligence.
2) Encode your domain knowledge before it walks out the door.
Every retailer has at least one person who can look at a customer and assemble the right solution in under 60 seconds. That person's knowledge is worth millions in conversion lift, return reduction, and repeat purchase rates. But it lives in their head, works 40 hours a week, and leaves when they do. The retailers that encode that expertise into structured, machine-readable formats aren't just improving their product feeds. They're preserving their most valuable and most perishable competitive advantage.
3) Measure the cost of semantic invisibility, not just the cost of technology.
When I talk with mid-sized retailers, the question is always "What does it cost to enrich our catalog?" The better question is "What is it costing us that our catalog is currently invisible to AI-driven discovery?"
If agents are already indexing your products and finding them too thin to recommend, you are losing revenue today. Not tomorrow. Today. And the gap compounds every month as agent adoption accelerates.
The bottom line
The mental model most retailers carry is: "AI is a tool we'll adopt when the time is right." The reality is that AI is a system that is already evaluating you, and your product data is the exam.
You don't get to decide when to show up. You only get to decide whether you're legible when the agent looks.
Logistics has been solved. Payments have been solved. Fulfillment has been solved. The last frontier of competitive advantage in commerce is not operational efficiency. It is semantic clarity. The ability to describe what you sell with enough precision that a machine can match it to a human's actual need.
Your margin is your metadata.
Hope you enjoyed the reading.
Álvaro T.
Sources
- Akerlof, G.A. (1970). "The Market for 'Lemons': Quality Uncertainty and the Market Mechanism." The Quarterly Journal of Economics, 84(3), 488-500.
- Salton, G., Wong, A., Yang, C.S. (1975). "A Vector Space Model for Automatic Indexing." Communications of the ACM, 18(11), 613-620.
- Nonaka, I. & Takeuchi, H. (1995). The Knowledge-Creating Company. Oxford University Press.
- Shopify (2025). "The Agentic Commerce Platform."

