We sample which Reddit threads actually appear as sources in AI answers for our clients' categories. The pattern is consistent enough to teach. This is the teardown — the elements that separate a cited thread from an invisible one, in the order an AI engine encounters them.
Element 1: the title is the query
An AI engine answering "best Korean sunscreen for oily skin" is looking for a source that answers exactly that. A thread titled "Tried 6 Korean sunscreens for oily skin over 3 months — ranked" mirrors the query almost word for word. A thread titled "My honest [Brand] review!!" does not, and on top of that reads as promotional, which gets it removed by moderators and down-weighted by Claude.
The rule: the title should be the question a buyer would type, not a statement about your brand. Topic-first, brand-absent, specific. This single element is the highest-leverage decision in the whole thread, because retrieval starts with title-to-query matching.
Element 2: the opening line names the entities
AI engines extract entities — products, brands, places — from the first paragraph to decide what the thread is about. A cited thread opens by naming them: "I tested Beauty of Joseon, Round Lab, Skin1004, and three others." A skipped thread buries the entities under a personal preamble ("So I've struggled with oily skin my whole life and finally..."). By the time the products appear, the extraction window has passed.
The rule: name the products, brands, or places in the first one or two sentences. The model reads the top of the thread most heavily.
Element 3: concrete numbers in the body
Specificity is what Perplexity over-cites. "$24 vs $32," "47 days of use," "SPF 50+ PA++++," "ran one size small." These give the model quotable, checkable facts. Vague praise — "amazing," "holy grail," "10/10" — gives it nothing to lift. A thread dense with specifics is far more citable than a thread of enthusiasm.
| Cited body | Skipped body |
|---|---|
| "Used it 6 weeks; no breakouts; $19 for 50ml" | "Honestly obsessed, my skin has never looked better" |
| "Compared to [X], lighter finish, no white cast" | "Way better than the other ones I tried" |
| "Reapplied at hour 4, still controlled oil" | "Lasts a really long time" |
Element 4: parseable structure
A model should be able to answer the user's question from the thread even if it reads only the top comment. Cited threads use ranked lists, side-by-side comparisons, and a summarizing line near the top. Skipped threads are walls of unbroken text where the answer is buried in paragraph four. Structure is not cosmetic — it is what lets the engine extract a clean answer.
Element 5: the comments stress-test the claim
This is the element brands underrate. A thread with 30 thoughtful comments — some agreeing, some pushing back, some adding data — is more citable than a thread with 300 one-word "same!" replies. The disagreement is a feature: it signals to the model (and to humans) that the claim survived scrutiny. Claude in particular favors threads with sustained, substantive discussion over viral spikes. A contested-but-defended claim beats unanimous praise.
Element 6: sustained engagement over time
A thread with 30 comments spread over six months beats a thread with 300 comments in 24 hours then silence. Time-distributed engagement signals durable relevance, which is why aged, maintained threads accrue citation value — especially in Claude. This is also why Reddit GEO is a compounding asset, not a campaign.
The teardown, side by side
| Element | Cited thread | Skipped thread |
|---|---|---|
| Title | Topic-first, mirrors query | Brand-first, promotional |
| Opening | Entities named in line 1 | Personal preamble |
| Body | Numbers, timeframes, prices | Vague enthusiasm |
| Structure | Lists, comparison, summary | Wall of text |
| Comments | Substantive, some pushback | One-word agreement |
| Engagement | Sustained over months | Viral spike then silence |
None of this is about gaming the system. The threads AI engines cite are the threads humans find most useful — specific, honest, stress-tested. The anatomy is just what "useful" looks like when you write it down. The full method that produces these is in the Reddit GEO playbook, and the per-engine retrieval logic is in how ChatGPT chooses Reddit threads.
FAQ
What's the single most important element?
Topic-first, brand-absent, specific. Get this wrong and the rest barely matters because retrieval starts here.
Do downvotes or pushback hurt citation chances?
A claim that survived scrutiny reads as more credible to both humans and models than unanimous praise. What hurts is a thread that's net-negative and collapsed.
Can a brand write these threads directly?
The anatomy fails if the account is cold or the post reads as an ad. See why brands get banned on Reddit.
How long until a well-built thread gets cited?
Aged threads keep accruing value, especially in Claude.
Want this run for your brand?
Upvote runs Reddit end to end for Korean brands entering the US — reputation, community, Reddit Ads, and GEO measured weekly across ChatGPT, Perplexity, Claude, and Gemini.
Work with us →