How do AI engines (ChatGPT, Perplexity, Gemini) decide which brands to cite?
AI engines decide which brands to cite by combining three signals: how cleanly they can identify the brand as a real entity, how often that entity is mentioned in sources the engine already trusts, and how well the brand's own site reinforces the entity with structured data and consistent naming. There is no ranking position. The engine reads dozens of sources, synthesises an answer, and includes the brands whose signals are strongest across those three axes.
Signal 1: Entity recognition
The first question the engine answers internally is "is this a real, distinct thing?" If the brand name is ambiguous, generic, or absent from the engine's training data and live retrieval set, the brand gets skipped even when its content is good. This is why structured data matters.
Concrete fixes that move this signal:
- Organization schema on the homepage with consistent
name,url,logo, andsameAsentries pointing to LinkedIn, Crunchbase, and any directory listings. - Product or Service schema on category pages with clear naming.
- A Wikipedia-style "about" page on the site that defines the brand in the same words the client uses elsewhere.
- Consistent brand naming across all directories. If the client is "Acme Corp" on G2 and "Acme Inc." on Capterra and "Acme" on LinkedIn, the engine sees three weak entities instead of one strong one.
Signal 2: Citation density in trusted sources
Engines do not weight all sources equally. A mention in a Forbes Advisor listicle, a G2 category page, or a respected trade publication carries more weight than a mention on a low-traffic blog. This is the part that looks most like classic SEO but plays differently: raw backlink volume does not matter. Named mentions in specific source types do.
For most agency clients, the source set worth targeting includes:
- Category-defining review sites in their vertical (G2, Capterra, TrustRadius for SaaS; Houzz for home services; Clutch for agency services)
- Listicle and comparison content on respected trade publications
- Analyst-style write-ups from independent commentators with established LinkedIn or Substack followings
- Wikipedia, where the entity qualifies for an entry
This is where agency PR and earned-media work starts to overlap directly with GEO. A single Forbes Advisor placement that names the client by name in a shortlist can shift citation behaviour across all five engines within a few weeks.
Signal 3: On-site reinforcement
The third signal is whether the brand's own site supports the engine in summarising what the brand does. Pages that are easy to summarise get cited; pages that bury the answer get skipped.
What "easy to summarise" looks like:
- Clear H1 that states what the page is about in the same language a buyer uses
- A TL;DR or summary paragraph in the first 100 words
- Specific named offerings and pricing where appropriate
- FAQ schema on pages that answer buyer questions directly
- Internal linking that signals topical depth (the brand has not just one page about X but a cluster)
A concrete example
An agency runs the prompt What is the best landing-page builder for a marketing agency? on ChatGPT. The engine cites Unbounce, Instapage, and Webflow. The agency's client, a similar tool called Acme Pages, is absent. The agency does not need to guess why. A scan shows:
- Acme Pages has Organization schema but no Product schema, so the engine sees a company but not a clearly named product entity.
- Acme Pages is not on G2's "landing page builder" category page.
- The most recent listicle covering this category was on a marketing-ops trade publication; Acme Pages is mentioned in a footnote on that page but not in the comparison body.
- Acme's homepage describes the product as "a conversion platform for growth teams", while G2 and the listicles consistently call this category "landing-page builders".
None of those fixes are mysterious. Add Product schema. Submit to G2's category. Pitch a follow-up comparison piece to the trade publication. Rewrite the homepage hero to use the buyer's category language. Within 60-90 days, the same prompt usually starts naming Acme.
What does not move the needle
- Adding more blog posts on the brand's own site, by itself. Engines weight third-party mentions higher than self-publication.
- Stuffing schema with keywords. Schema is for identity, not ranking.
- Buying low-quality backlinks. AI engines mostly ignore these and so does Google.
- Optimising for a single engine. The same upstream signals power all five, with engine-specific weighting on top.