The AI Commerce Tracking Gap: How to Know When an Agent Recommended You (When GA4 Doesn't Tell You)
By Steve Merrill | April 26, 2026
Adobe Analytics reported AI-referred traffic to Shopify stores is up 393% so far this year. Most store owners look at their GA4 dashboard and see maybe a third of that.
The rest is gone. Buried in direct traffic, swallowed by "other," or just not recorded. This gap matters a lot right now.
If you're building out your AI visibility strategy, fixing product feeds, writing llms.txt files, getting structured data right, and measuring results with a broken ruler, you're making bad decisions. You're undervaluing the channel, not investing enough, or thinking it's not working when it actually is.
Why Your Analytics Can't See AI Traffic
Three things are happening at once.
First, most AI platforms strip the referrer header when they send users to external sites. A ChatGPT user clicks a product recommendation, lands on your store, and GA4 sees a direct session. No source. No medium. Just "direct/none."
Second, some platforms route clicks through redirects. The referring domain is an internal link, not the AI platform's domain. GA4 has no way to map that back to its origin.
Third, Shopify's default GA4 setup fires client-side. Users with privacy extensions, certain mobile browsers, or aggressive ad blockers can shut that tracking script off entirely. Server-side data collection doesn't have this problem. Most stores haven't set it up.
The practical result: your real AI traffic is probably 2-3x what shows in your standard reporting. For a store doing $500K/year and actively working on AI visibility, that gap means you're flying blind on ROI.
According to Adobe Analytics research on AI shopping trends, AI-originated traffic converts at significantly higher rates than average site traffic. Getting these numbers right matters more when the channel converts well.
How to Actually Track It
Step 1: UTM tags on everything you control
Any link pointing to your store that lives in content you control, your llms.txt file, your About page, any published content referencing your products, needs UTM parameters.
The structure is simple: ?utm_source=perplexity&utm_medium=ai-referral. Replicate this for each AI platform you want to track. When AI platforms crawl and resurface this content, some of those UTMs carry through to your analytics.
It's not a complete fix. It costs nothing and catches clicks that would otherwise disappear.
Step 2: A GA4 segment for known AI domains
Create a custom segment filtering sessions where the source matches any of these: chatgpt.com, perplexity.ai, gemini.google.com, bing.com/chat, copilot.microsoft.com, claude.ai.
Save it. Track it monthly. You're looking for trend direction here, not precise session counts. The trend tells the story.
Step 3: Watch your direct traffic closely
When an AI platform picks up your content, mentions your product, or recommends your store, you typically see a direct traffic spike 24-48 hours later. That's the stripped referrer showing up in your data.
Start keeping a log. Date every known AI exposure event, a product mention, a citation in Perplexity, a brand name appearing in ChatGPT Shopping, then check if your direct traffic moved shortly after. The pattern is surprisingly consistent once you start watching for it.
Step 4: Use Shopify's AI Channels report
Shopify added an AI Channels report under Analytics > Acquisition. It uses server-side data that captures sessions GA4's client-side tracking misses entirely.
Pull both numbers side by side: Shopify's AI Channels total and your GA4 AI segment total. The difference between them is roughly what's getting lost in your standard reporting. That's your true data gap.
What to Use for Measuring ROI
Stop relying on any single number. Use a composite.
Make Shopify's AI Channels report your primary metric. Add your GA4 AI segment as a secondary signal. Watch branded search volume as a leading indicator. AI recommendations tend to drive branded searches within 48-72 hours. Run manual tests in ChatGPT and Perplexity regularly to confirm your store is actually appearing.
That combination gives a much cleaner picture than GA4 alone. None of these are perfect. Combined, they're close enough to make real decisions.
According to Google's GA4 event tracking documentation, server-side tracking is the recommended approach for accurate data collection. Most Shopify stores haven't done it because the default setup is client-side.
One Quick Fix Worth 30 Minutes
Add structured metadata to your site header and llms.txt that explicitly identifies your store and links to your product pages. When AI platforms crawl your site, this helps them understand your store's identity and what you sell.
It's a marginal win on the data gap. It's also a direct improvement in how AI agents represent your store in their responses, so it's worth doing regardless. If you haven't written a proper llms.txt file yet, that's the place to start.
Frequently Asked Questions
Why does GA4 miss AI referral traffic from ChatGPT?
ChatGPT strips referrer headers when linking out. GA4 sees the session as direct traffic, not crediting it to ChatGPT. Some platforms also route clicks through redirects that further hide the origin.
How large is the tracking gap for most Shopify stores?
40-60% of AI-originated traffic ends up mislabeled. Most stores are seeing 2-3x their reported AI traffic in actual visits compared to what shows in GA4.
Which AI platforms send the most trackable referral data?
Perplexity is the most consistent. Google AI Mode and Gemini pass partial referrer data. ChatGPT and Copilot are the worst, especially for clicks inside a product card or chat session.
How do I know if AI is recommending my store if I can't track it?
Test it yourself. Search ChatGPT, Perplexity, and Gemini for products you sell and see if your store shows up. Then watch branded search volume. If you're getting recommended, branded searches typically spike within 48-72 hours.

