Scraping

Historial
  • Extract data from SSR sites (React, Next.js, Remix, etc) (example).
  • Some websites will have JSON-LD embedded for search engines to scrap. You can, too! <script type="application/ld+json">
  • Extract sitemaps to get a list of pages to scrap.
  • Extract <meta> tags for basic and not-so-basic metadata.
  • Inject code into React apps (monkeypatching or using the browser debugger) and send the data to your own API server. Demo