SEO Checker + SEO Audit Tool
Crawls all web pages on a specific website and analyzes them from the search engine optimization (SEO) perspective. For example, the actor finds broken links, missing images, and provides information about possible page improvemen...
Google Places Scraper
Extract location details from Google Places which are not provided by Google Maps API like review, photos and popular times.
HTML to PDF Converter
Open a web page in headless Chrome using Puppeteer and print it to PDF. The input is a JSON object and output is a PDF file.
You can use this actor to monitor any page's content and get a notification when content changes. Technically it extracts text by a given selector and compares it with the previous run. If there is any change, it runs another act...
Amazon crawler - this configuration will extract items for keywords that you will specify in the input, and it will automatically extract all pages for the given keyword. You can specify more keywords on the input for one run.
Twitter Hashtag Scraper
This Twitter Hashtag Scraper will scrape and extracts all tweets for given hashtag and provide output in JSON, XML, CSV or HTML.
Crawl for hotels on Booking.com based on search query without API and export data to JSON or CSV. This crawler works best with automatic Apify proxy.
Crawler Cheerio is a ready-made solution for crawling the web using plain HTTP requests to retrieve HTML pages and then parsing and inspecting the HTML using the Cheerio NPM package. Cheerio is a server-side version of the popula...
Kickstarter Search Scraper
Missing Kickstarter API? Need fresh Kickstarter news or list of best and finished projects? Try this new wrapper for Kickstarter search, which allows you to configure search filters and get the list of items from Kickstarter searc...
Contact Information Scraper
Scrape and extract contact information (e-mails, phone numbers, social networks) from any website. Collect or pull and build your own customer database.
PDF to HTML Converter
Easy convert PDF to HTML using the pdf2htmlEX tool.
Extract data about homes from Zillow website without API and download into Excel, CSV or JSON.
Extract data from Transfermarkt website without API and export data to JSON, XML or CSV.
Google Sheets Import & Export
Import data from datasets or crawler executions to your Google spreadsheet. Or even just process the data you already have there!
Google Search Scraper
Crawls Google Search result pages (SERPs) and extracts a list of organic and paid results, ads, snap packs and more. Supports selection of custom country or language, and extraction of custom attributes.
JS Code 2 Flowchart
Google Cheerio Batch
Scrape Google search results in batches. Take a list of URLs as input and save to HTML. It requires GOOGLE_SERP proxy so if you don't have it enabled, contact Apify support
Broken Links Checker
Crawl your website and find broken links. Unlike other similar SEO analysis tools, it also reports broken URL #fragments. The results are stored in a JSON and HTML report.
Crawls a website using one or more sitemaps and imports the data to Algolia search index. The text content is identified using simple CSS selectors. The actor simply runs the algolia-webcrawler NPM package (https://www.npmjs.com/...
Example showing how to use headless Chromium with Puppeteer to open a web page, determine its dimensions, save a screenshot and print the page to PDF. For more information about Puppeteer, please see https://github.com/GoogleChro...
Act sends mail.
Crawler is a ready-made solution for crawling the web using the Chrome browser. It takes away all the work necessary to set up a browser for crawling, controls the browser automatically and produces machine readable results in sev...
Act which takes URL and array of strings to search for and returns a definition of a crawler.
Article Text Extractor
Simply extracts article text and other meta info from given url. Uses https://github.com/ageitgey/node-unfluff which is a NodeJS implementation of https://github.com/grangier/python-goose.
Crawler To Spreadsheet
This crawler takes last crawler run result and stores new items in Google Docs Spreadsheet.
Google Search Results Scraper
Scrape Google Search Results (SERP) with this crawler and export your SEO rankings with this online tool into Excel, JSON or any other format.
Anti Captcha Recaptcha
Act for solving google recaptcha using the anti-captcha.com service. You need to have an anti-captcha subscription to be able to use it.
Url List Download Html
This act accepts a url list and downloads HTML of each page. It has input parameter - "sources" (see soursec parameter of UrlList https://www.apify.com/docs/sdk/apify-runtime-js/beta#RequestList).
Example Hacker News
Example crawler for news.ycombinator.com build using Apify SDK
Linkedin Sign In Example
This act shows how you can sign-in to LinkedIn with Apify act with puppeteer image. It is only example how to do it. Usage: 1. Copy&paste this code to your act. 2. Set up email forwarding for user, that use for sign-in: FROM: ...