In addition to offering support for the JavaScript deployment (using highlight.js) Schema App supports using its web crawler to generate the markup. The Crawler should be used when one is interested in no impact to page speed and can manage markup updating less frequently. 


The setup for the AEM connector only differs in the configuration you choose between Crawler and JavaScript. The operation of the Schema App connector only differs in that it does not include the highlight.js script on the page and will not request highlight.js produced markup from Schema App's CDN. The flow of the integration is shown below. 



The Crawler will run on a schedule typically once a week where it will start and crawl your site. It will use a combination of the sitemap and links extracted from the pages. If no sitemap is present it will start on the homepage and pull links to reach the rest of the site. It attempts to match templates created in the Schema App Highlighter as it parses each page. If a template is matched Schema Markup is produced and sent to Schema App servers where it is converted to JSON-LD. The JSON-LD is then stored on S3 that is attached to a CloudFront CDN. The Schema App connector in AEM will send requests on a schedule to the CDN and pull markup that is produced by the crawler, if the markup has changed it will be updated in the page and pushed to out the dispatcher.