Article schema is structured data that tells AI engines what kind of content a page contains. Blog post, how-to guide, news article, or general article. AI recommendation systems use this classification when deciding whether your page is an authoritative expert resource or a product listing, so the right schema type makes that judgment instant and unambiguous.[1]
Add BlogPosting schema (not generic Article) to every content page, combined with Author and FAQPage schema in a single @graph block.
Schema classifies your content before AI reads a single word. The right type tells AI this is dated writing. Not a product page, not a press release, not a service description.
Open your most trafficked blog post and check the source code for a script tag with type="application/ld+json". If it's not there, that page is unclassified to AI.
Article schema is written in JSON-LD format. A lightweight JavaScript notation that lives inside a <script type="application/ld+json"> tag in your page's HTML. It does not appear visibly on the page. Its entire audience is machines: search engine crawlers, AI bots, and any other system that reads your HTML before rendering it.
The schema block contains structured properties that describe the content. The headline, the author, the publication date, the URL, and a brief description. When AI reads your page, it doesn't have to infer these things from the copy. You've handed them over directly, in a machine-readable format it already knows how to parse.[1]
Here's a minimal, valid BlogPosting schema block for an expert node:
{
"@context": "https://schema.org",
"@type": "BlogPosting",
"headline": "What is article schema markup?",
"description": "Article schema classifies your content for AI engines.",
"url": "https://yourdomain.com/your-page",
"datePublished": "2026-03-18",
"dateModified": "2026-03-18",
"author": {
"@type": "Person",
"name": "Your Name",
"url": "https://yoursite.com"
},
"publisher": {
"@type": "Organization",
"name": "Your Business Name",
"url": "https://yoursite.com"
}
}
This is the minimum viable schema for an content page. Adding it to every node in your directory ensures that AI systems can classify and index your content correctly. Without guessing.[2]
Schema.org defines a hierarchy of article types, each with a different signal for AI classification. Understanding the distinctions matters because using the wrong type creates ambiguity. And AI engines dislike ambiguity when they're deciding what to recommend.
| Schema Type | Best For | AI Signal |
|---|---|---|
Article |
General editorial content, opinion pieces, research summaries | Broad. Works but not specific |
BlogPosting |
Expert Q&A nodes, dated knowledge posts, authority directory content | Strong. Signals dated writing |
NewsArticle |
Time-sensitive announcements, industry news, event coverage | Temporal. Deprioritized over time |
TechArticle |
Technical documentation, developer guides, API references | Specialized. Strong for tech audiences |
HowTo |
Step-by-step instructional content with discrete numbered steps | Procedural. Distinct from article types |
For most authority directory nodes. question-based posts answering a specific query. BlogPosting is the correct choice. It signals that this is a specific, dated piece of writing, not a service page or a generic article aggregated from other sources.[3]
Not all schema properties carry equal weight. Some are required for schema to be valid; others are strongly recommended by Google's structured data guidelines; still others add incremental value. For authority nodes, here's the priority stack:
Required for validity:
headline. Must match your H1 exactlyauthor. The Person or Organization who wrote itdatePublished. ISO 8601 format (YYYY-MM-DD)Strongly recommended:
description. A 1–2 sentence summary (can match your meta description)url. The canonical URL of this pagedateModified. Signals freshness to AI enginespublisher. Links the content to your organizationAdditive for authority:
image. An ImageObject with URL, width, and heightmainEntityOfPage. A WebPage reference confirming this is the primary contentwordCount. Signals content depth to AI classification systemsThe author property deserves special attention. A bare string like "author": "Cindy Molchany" is weaker than a full Person object with a URL and sameAs links to your LinkedIn and other verified profiles. The richer the author object, the stronger the E-E-A-T signal.[4]
AI recommendation engines don't just read what your content says. They read what type of content it is before they decide how to use it. This is the distinction that most experts miss entirely.
A page with no schema markup forces AI to make inferences: Is this a product page? A press release? A blog post? A service description? These inferences are slower, less reliable, and often wrong. A page with correct article schema skips the inference step entirely. AI reads the schema, classifies the content in milliseconds, and proceeds to evaluate whether your answer is substantive enough to recommend.
Schema markup is the pre-classification that earns AI's attention. Without it, your excellent content may be read. But it's less likely to be categorized correctly, which means it's less likely to be surfaced in response to the queries it answers.
The practical implication: every content page you publish should have schema on it from day one. Not added as an afterthought after the page has already been indexed without it.
Two tools are the standard for schema validation. Use both.
Google's Rich Results Test (search.google.com/test/rich-results). Paste your page URL or raw HTML and it will show you exactly which schema types were detected, which properties were recognized, and which have errors or warnings. This is the authoritative tool for checking Google's interpretation of your markup.
Schema.org Validator (validator.schema.org). Validates your JSON-LD against the official Schema.org specification. Useful for catching structural errors that Google's tool might overlook.
What to look for when validating:
BlogPosting (or whichever type you used) correctlyRun validation every time you create a new node. One malformed schema block can invalidate all schema on that page.
Every node on this site uses BlogPosting schema. Not the generic Article type. And that's a deliberate decision. BlogPosting is more specific. It says: this is a dated piece of writing, by a named person, published on a specific day, about a specific topic. Generic Article says: this is some kind of article. The specificity matters.
AI engines are pattern-matching systems. When they encounter a page with BlogPosting + Person author + FAQPage schema stacked together, they receive a coherent, reinforcing signal: this is content, it has a verified author, it's structured as Q&A. That combination is rare enough to be notable. Most websites don't have any of these layers, let alone all three.
The Authority Directory Method treats schema markup as infrastructure, not decoration. You wouldn't build a house without wiring it. And you shouldn't publish an page without classifying it. The wiring is invisible to visitors. It's entirely visible to the machines making recommendation decisions.
Article schema is one type of structured data. Structured data is the broader category. It encompasses all schema markup types, including FAQPage, Person, Organization, BreadcrumbList, HowTo, and more. Article schema specifically classifies the content type of a page. Most authority nodes benefit from multiple schema types stacked together in a single @graph block.
Google does not officially confirm that schema markup directly improves rankings. However, proper article schema. Especially when combined with Author schema. Contributes to E-E-A-T signals (Experience, Expertise, Authoritativeness, Trustworthiness), which Google uses to evaluate content quality. More directly relevant for businesses: article schema helps AI engines classify and recommend your content in response to relevant queries.
For a BlogPosting, the minimum recommended properties are: headline (the H1), description (a clear summary), author (with name and url), datePublished, and url. Adding dateModified, publisher, and image strengthens the signal. For authority nodes, combining BlogPosting with Author schema and FAQPage schema in a single @graph block gives the strongest classification signal.
Article schema goes in a script tag with type="application/ld+json" inside the head element of your HTML file. It must be present in the static HTML source. Not injected by JavaScript after the page loads. AI crawlers like GPTBot and Claude-Web do not execute JavaScript, so schema that only appears after JS runs is invisible to them.
Use article schema only on pages that contain article-type content: blog posts, expert guides, opinion pieces, how-to instructional content, and news. Do not apply it to service pages, contact pages, or home pages. Those use different schema types (Service, ContactPage, WebSite/Organization). Misapplying schema sends a confusing classification signal to AI engines.
Take the free AI Visibility Scan to discover your schema gaps and positioning opportunities, or explore the complete build system.