Where LLMs Actually Find Your Brand When People Stop Searching Google

TL;DR: When a reader asks ChatGPT, Perplexity, or Google's AI Mode "what should I use for X", the answer doesn't come from Google's top-10. It comes from a different surface set: Reddit, YouTube, podcasts, Hacker News, Stack Overflow, topical forums. The model saw those at training time and re-cites them at inference time. If your brand isn't present there, the model can't surface you, and ranking number one on Google won't change that. The fastest way to see the gap is a one-evening audit: pick three queries you'd want to surface for, ask three LLM panels the same prompt, log which competitors get cited and from which sources. The investment priority isn't "do everything everywhere." It's the one surface where the audit shows the steepest gap between competitor presence and yours.

The question I keep getting

Two operators asked me the same thing in the same week. The first runs a B2B SaaS, ranks number one on Google for their primary keyword, pulls steady organic traffic. The second runs a consumer brand in a crowded category, ranks third or fourth, and is losing share to a competitor whose Google rankings are objectively worse. Both asked a variation of: "I'm doing fine on Google, but ChatGPT keeps recommending the competitor. What gives?"

I've spent the past six months poking at this for clients and for vadimkravcenko.com, and the short answer is that Google rankings and LLM citations read from different graphs. Google's graph is links plus signals around the URL itself. The LLM graph is built from a different set of sources: places where humans talk about brands without the brand owner controlling the conversation. Reddit threads, YouTube videos, podcast episodes, Hacker News submissions, Stack Overflow answers. (Side note: I didn't fully believe this until I ran the same three queries on Perplexity and ChatGPT side by side for that #1-on-Google B2B client and watched both engines name the competitor and skip them entirely. That was the week I started taking it seriously instead of treating it as a curiosity.) The model trained on those sources, and at inference time it still leans on them. With no presence in the citation graph, your Google ranking is mostly invisible to the LLM.

This piece is the map and the audit. The map names the five off-site surfaces that carry signal and which weighs heaviest by query type; the audit is the one-evening method I use to find where the gap sits for a specific brand. Then: if it returns X, invest in Y first, not all five at once.

The citation graph isn't Google's link graph

Rand Fishkin made the point most directly in a SparkToro piece on appearing in AI answers, calling out the two surfaces that "seem to still hold particular sway with AI answers":

Reddit and YouTube (which seem to still hold particular sway with AI answers).

Fishkin's read matches what I keep finding on client audits, with two surfaces to add: podcasts and developer forums. The reason is mechanical. The major LLMs train on Common Crawl plus licensing deals (OpenAI's Reddit deal, Google's Reddit deal, both 2024) plus content scraped directly: YouTube transcripts, podcast transcripts via Apple, Spotify, and Listennotes, Stack Overflow's content. At inference time, parametric memory (what the model learned in training) works alongside a retrieval layer (a live search index over a similar set of sources). Both lean on the same surface set, and both lean less on the open web than Google does, because the open web is noisier and citation-grade sources are higher signal per token.

The notable absence in this graph is X. Pre-2023 tweets are in Common Crawl; post-2023 X content is firewalled, with no Common Crawl access, no OpenAI deal, no Google deal as of this writing. X-native brand mentions carry low surfacing weight in current LLM citations, so operators putting effort into X for LLM discovery are spending against a closed door. (I should note: I re-check this every couple of months. Nothing has moved as of May 2026, but it's the single item on this map most likely to flip, so don't treat my "closed door" as permanent.)

What each surface actually carries

Five surfaces carry most of the weight in 2026, and the relative weight shifts by query type.

Map of five off-site surfaces — Reddit, YouTube, podcasts, Hacker News, Stack Overflow — with rough weight bars by query category covering SaaS, B2B, consumer, and local intent — The five off-site surfaces by query category. Bars are rough observation, not survey data.

Reddit carries the heaviest weight for "best X for Y" and "is Z any good" queries across most categories; the OpenAI and Google licensing deals (both 2024) explicitly include it. Subreddit relevance matters: r/SaaS, r/Entrepreneur, and r/marketing for B2B; r/BuyItForLife and product-category subs for consumer. A mention in a small but topically relevant subreddit weighs more than one in a high-traffic but irrelevant one.

YouTube transcripts are scraped and indexed, cited heavily for "how do I X" and "X vs Y" queries. The cited unit is the specific video transcript, not the channel, so a single comparison video can drive citations across dozens of related queries for a year after publishing.

Podcast episode transcripts feed both training data and retrieval indexes, cited disproportionately for B2B expertise queries like "who's an expert on X". The signal is recurring third-party validation: a founder who's been a guest on twelve category-relevant podcasts over two years looks completely different to the model than one who hasn't.

Hacker News is a high-signal source for technical and SaaS topics; both the submitted URL and the comments are indexed. The signal is conversational density, not the URL itself. A submission with fifty thoughtful comments weighs more than one with a hundred upvotes and no discussion.

Stack Overflow remains the technical-question authority, cited for "how to fix X" code queries. Its weight has softened post-2024 as its own traffic declined under AI-answer pressure, but for code-specific surfaces it still holds. Topical forums (industry-specific communities, indie message boards, Discourse instances) act as a sixth catch-all. (For what it's worth: this is where I'm least confident in the whole map. The signal varies so wildly by community that any generalization I make should be stress-tested against your specific category before you act on it.)

Surface	What it carries	Query types where it dominates
Reddit	Recommendation-style discussion + reviews	"best X", "is Y any good", "X alternatives"
YouTube	Workflow walkthroughs + comparisons	"how do I X", "X vs Y", tool demos
Podcasts	Long-form expertise signal	"who's an expert on X", B2B services framing
Hacker News	Technical + SaaS conversation density	Developer-tool + infra-tool queries
Stack Overflow	Code-specific authoritative answers	"how to fix X" code questions
Topical forums	Domain-specific recurring presence	Niche-industry queries

How the signal actually moves from mention to citation

The mechanism matters because the lag matters. A Reddit mention posted today doesn't surface in a ChatGPT answer tomorrow. The signal moves through two paths with different cadences.

Two parallel paths from mention to citation: training-data ingestion (months) and retrieval indexing (days to weeks).

Path one is training data. A mention gets scraped or licensed, lands in a training corpus, and shows up in parametric memory. The lag runs in months, usually six to eighteen (my read from watching client mentions appear over time, not a published figure, so treat it as directional). The signal is durable once it's in (a 2023 mention still influences a 2026 inference) but slow, and weighted toward repeated mention rather than single events.

Path two is retrieval. The major LLMs run a live search-index layer (Perplexity makes this most explicit; ChatGPT's browse mode leans on it; Google AI Mode obviously does). It indexes fresh content faster, days to weeks, and is queried alongside parametric memory at inference time. A recent mention can surface within a few weeks this way, but the citation tends to be more URL-specific and less durable. The practical implication across both paths: cadence beats any one mention. A brand in four Reddit threads, three YouTube videos, and two podcast episodes over a year looks different than one mentioned once, and the audit reads that cumulative picture.

The one-evening audit

The audit is cheap and the part of this piece you should actually run before anything else. My first pass took the better part of an evening, three to four hours (treat that as directional, not a benchmark; one ran to six because I kept chasing citations into rabbit holes). Quarterly re-runs drop to about an hour. Five steps.

Five-step audit flowchart: pick three queries, run on three LLM panels, log cited sources, read the gap, pick one surface to invest in first — The audit in one picture. Three queries, three engines, log the citations, pick one surface.

Step one: pick three queries you'd want to surface for. Be specific. "Best project management tool for a five-person remote agency" beats "good PM software" because it's closer to how a real user types into a real LLM. One head query (what you'd love to win) and two long-tail queries (what you'd realistically expect to win first).

Step two: run each query on ChatGPT, Perplexity, and Google AI Mode. Capture the full answer plus the cited sources. If an engine doesn't cite by default, prompt for "what sources did you base this on?" Most will hand them over on request.

Step three: log the citations. For each cited source, note which surface it sits on (Reddit thread, YouTube video, podcast episode, Hacker News submission, Stack Overflow answer, brand-owned page, neutral article, competitor page). The pattern across nine answer-source pairs is usually clear within twenty minutes. (Step three takes longer than you expect. Twenty minutes becomes two hours the first time, because you'll want to open every cited thread and read it. Build the time in, and resist that urge on the first pass.)

Here's roughly what the log looked like for the B2B SaaS operator from the top (name and category kept out, so read this as illustrative, not a published case study). The head query was a "best [category] tool for [team size]" prompt. ChatGPT named four tools (the client not among them) and cited three Reddit threads, one YouTube comparison, and one Hacker News thread. Perplexity leaned harder on Reddit; Google AI Mode pulled the YouTube video and a different Reddit thread. Across the nine answer-source pairs the competitor showed up in seven, cited from Reddit four of those times; the client showed up in zero, with not one citation pointing at their own site despite it ranking #1 on Google. The pattern read in under twenty minutes: not a content problem, not a backlink problem. A presence problem, and the surface was Reddit.

Step four: read the gap. Two readings matter. Which surfaces dominate the citation pool for your query type? And which competitors get cited, from which surfaces? If a competitor shows up in seven of nine answers, cited from Reddit four times (the exact pattern above), the gap is Reddit. If they're cited from YouTube, the gap is YouTube. The map up top tells you whether that matches your category's typical pattern.

Step five: pick ONE surface to invest in first. The temptation is all five at once. Don't. Pick where the gap between competitor presence and yours is steepest and consistent with the category pattern. Energy spread thinly across five surfaces returns less than focused energy on one.

If you want a more rigorous version with persistent tracking, the AI visibility audit methodology piece walks the longer-form approach. The one-evening pass is enough for a first read.

Reading the audit and picking the investment

I have not once seen an audit where the gap was evenly spread across five surfaces. It's always one or two, and for founders it's almost always Reddit plus either podcasts or YouTube. What the AI engines reward isn't ranking; it's recurring, varied, third-party mention. A brand discussed naturally across Reddit threads and YouTube comparisons for two years looks completely different to the model than one present only on its own marketing pages.

That framing is the key, and the part I get wrong when I'm impatient. Promotional behavior on any of these surfaces gets penalized harder than absence. The investment that works is earned: real subreddit participation, supporting category-relevant YouTube creators, podcast guest spots backed by actual expertise, Hacker News engagement that's technical opinion rather than a pitch.

Two-column investment-priority table mapping audit results — competitors cited from Reddit, YouTube, podcasts, Hacker News, or Stack Overflow — to the first off-site surface to invest in — The investment-priority decision table. Map your audit result to the first surface to invest in.

Audit result	First investment
Competitors cited from Reddit threads, you're absent	Earned Reddit presence in a few category-relevant subs, sustained over roughly half a year
Competitors cited from YouTube videos	Sponsored mentions on existing category channels OR channel building with monthly cadence
Competitors cited from podcast episodes	Guest appearances on category podcasts, aiming for a steady cadence across a year
Competitors cited from Hacker News submissions	Original technical writing posted to HN by team members with track records
Competitors cited from Stack Overflow answers	Engineering-team time on high-signal threads in your tool's category

Two practical notes from watching operators (and myself) try this. First, the timeline is slow. In the handful of cases I've tracked to anything like completion, a few clients with quarterly re-audits over a year, the shift showed up around month eight or nine. Nothing at month six, real movement closer to twelve, so I'd call "six to twelve months" safer than any point estimate, and even that is my observation rather than a law. (I hate this answer. Six months is a long time to invest with no feedback loop, and the quarterly re-audit is the only useful progress signal I've found.) Anybody promising faster is selling you something. Second, this isn't a tool-purchase decision; it's a team-time decision. A Reddit gap can't be closed by a Reddit tool. It gets closed by a person with category credibility spending a couple of hours a week as a real participant in a few subreddits, which is exactly why it's so hard to outsource.

Where this gets messy

The five-surface map is a snapshot, not a permanent picture. Three things move underneath it.

One, the weights shift as training data shifts. The next model generation might weigh podcasts heavier and Reddit lighter, or lean further on retrieval over parametric memory. The map above is my mid-2026 reading; mid-2027 might read differently, and I won't know until I re-run the audits.

Two, the surfaces themselves shift. The X firewall is recent; Stack Overflow's softening is recent. I'm still calibrating on Bluesky, Threads, and Mastodon: I've run ten or so audits this year and not one surfaced any of the three as a citation source, which is why they're not on the map yet. That could change inside a year, and the audit would catch it.

Three, promotional behavior gets caught faster than the signal accumulates. A brand that hires an agency to seed posts in a dozen subs in a month doesn't see benefit and often sees penalties; Reddit's mod culture, its anti-spam, and the LLM training pipelines all catch the pattern. The investment that works is slower and more boring than any agency pitch will admit.

So the honest version, the one I'd give a founder over coffee: this map will read differently in twelve months. The audit method won't. Re-run it quarterly, update your read, adjust the priority. Don't lock into a multi-year contract with anyone promising "ChatGPT citations." The work that moves the needle is team time, not agency time.

What this doesn't replace

Off-site brand presence is a complement to on-site optimization, not a substitute. The framing that works for me: off-site presence makes the LLM aware of you; on-site quality (schema, clean hierarchy, internal linking, content depth) determines whether that awareness converts to a citation rather than a dismissal. Most operators I hear from have over-invested on-site and under-invested off-site, because the on-site moves feel concrete. The AI-first search optimization piece covers the on-site side; the foundational AI-driven SEO piece covers the joint framing.

If you came out with a surface gap

Two next-action reads, depending on where the audit pointed. If the gap was broad, with competitors cited from three or more surfaces while you're absent across most, read the multi-source brand pickup piece. If it was narrow and concentrated on one surface, go to the deep dives:

Podcast SEO for the podcast path
Video SEO for YouTube for the YouTube path
Brand-in-ChatGPT for the narrower angle
The GEO mentions-in-AI piece for the concept underneath it all

The piece you don't need is the one promising "ChatGPT optimization in 30 days." The audit is the thing. Run it, read the gap, pick one surface, commit a year of focused team time, re-audit in three months. If you want to skip the spreadsheet on the first pass, the SEOJuice AI visibility checker runs the query panel across the LLM engines and logs the cited sources automatically: the manual audit above without the manual part. Find your steepest gap there, then commit the team time, because that's the part no tool does for you.

FAQ

Does posting on Reddit and YouTube actually move the needle for LLM citations?

Yes, but the timeline is slower than most operators want. Six months minimum to register, twelve months to see meaningful shift in citation patterns. The signal that works is recurring, varied, third-party mention — not paid placement. Promotional behavior gets caught and penalized faster than the underlying signal accumulates.

Why don't LLMs cite Twitter or X anymore?

Pre-2023 tweets are in Common Crawl and still influence training data. Post-2023 X content is firewalled — no Common Crawl access, no OpenAI deal, no Google deal as of mid-2026. X-native brand mentions have low surfacing weight in current LLM citations. This can change; X's data could open back up. As of this writing, operators investing in X for LLM discovery are spending against a closed door.

How often should I re-run the audit?

Quarterly. The weights shift slowly; the surfaces shift slowly; re-running monthly is overkill and noise-prone, but yearly misses platform-rise events. A quarterly cadence catches the meaningful shifts without burning team time.

Should I pay to be mentioned on podcasts or YouTube channels?

Yes, if the channel is category-relevant and the placement is a substantive sponsor mention in long-form content. Banner-ad placements and pre-roll ads don't carry citation signal. A sixty-second sponsor read in a podcast where the host genuinely engages with what you do carries signal; a generic ad slot does not. Cost-benefit only works for channels with category audience overlap.

What about Bluesky, Mastodon, and Threads — do those count?

Not yet at meaningful weight in any audit I've run through mid-2026. Track them as potential future surfaces but don't invest first. The audit method will catch a meaningful weight shift if one of these surfaces rises; until then, your team time goes further on the five established surfaces.

Our powerful suite of automation tools for SEO

Learn, discover, and get inspired by our content