Most websites include metadata for recording a date when a publication (such as a blog, article, or a report) is published or updated. You can index this metadata in your collection to display this information in the search results, use it for sorting, or for filtering.
This how-to-guide details standard metadata fields the Sajari crawler detects and how you can index date metadata in your collection.
...
A website page might have multiple date metadata fields on it, for example, datePublished
, modifiedDate
, & lastupdated
, etc. The crawler indexes date metadata automatically if the best practices are followed and the correct meta fields are used.
...
Note: If you do not provide a datetime
with a timezone we will parse it as UTC.
Property | Expected Type | Description |
---|---|---|
| Date or DateTime | The date on which the CreativeWork was created or the item was added to a DataFeed. |
| Date or DateTime | The date on which the CreativeWork was most recently modified or when the item's entry was modified within a DataFeed. |
| Date | Date of first broadcast/publication. |
...
How to index date metadata
Instructions
Identifying Identify the metadata field that you want to use for sorting resultsindex in your collection.
If the field are is using Open Graph Protocol or Schema.org entities, then skip to step 4.
Add a schema field for the date metadata field via the Schema section and add
data-sj-field="fieldname"
attribute to the metadata (see detailed instructions).Re-index a sample page via the "Diagnose" tool in the Domains section. Once indexed, the record should be updated and the metadata should be added to the field.
Check that the record have has been indexed and has the correct field value. You can check this via the Preview section. Use "Expand all" to display all fields and use filter (e.g.
"filter":"url='http://www.url.com'"
) to check a specific page.Once verified that the metadata is being indexed correctly, re-index all domains in the Domains section.
...
...