Apify for developers

Apify is a web scraping and automation platform. It provides a number of services to help you turn any website into an API.

  Crawler

Recursively crawl large websites and extract data from them using a few simple lines of JavaScript.



  • code
    Full JavaScript power

    Tell the crawler what to do using a few simple lines of JavaScript with jQuery, Underscore.js or any other client-side library. Your code runs in a browser hosted on our servers.

  • event
    Navigates arbitrary websites

    Crawl and extract data from websites that employ AJAX, complex pagination or infinite scroll using the same tools you already use for your front-end development.

  • storage
    Entirely in the cloud

    No need to install browser plugins, download any tools, set up cron jobs, databases or proxy servers. You already have all you need - your current web browser.

settings  Actor

Actor is a serverless computing platform that enables creation of custom web scraping and automation jobs. A single Actor job is called an act.



  • Use headless Chrome and Puppeteer

    Apify provides a first-class support for running headless Chrome, either using Selenium WebDriver or Puppeteer. You only need to select the appropriate base image for your act and then call launchPuppeteer() or launchWebDriver() function provided by the apify NPM package.

  • Host your code on GitHub

    The source code of your acts can be hosted directly on Apify, as well as on GitHub, Docker Hub or in fact anywhere on the internet. Your act can be automatically built and published on every commit to Git using a simple webhook. It only takes a few clicks to move your existing project to Apify and publish it in the library for anyone to use it!

  • Go beyond traditional serverless platforms

    Unlike traditional serverless computing platforms, the run of an act is not time-limited. Acts can run for as long as necessary, even forever. The act can perform anything from a simple action, such as sending an email or cropping an image, to complex operations such as crawling an entire website and removing duplicates from a large dataset.

flare  Additional services

There is more.



  • event
    Scheduler

    Schedule your crawlers or acts to run periodically every day, hour, minute or even every second using a cron-like expression.

  • storage
    Storage

    Apify provides a simple key-value store to help you quickly store and retrieve data from your acts.

  • device_hub
    Integrations

    Apify integrates seamlessly with Zapier, Keboola, Integromat or in fact any other service thanks to the REST API.

Become
an Apify expert

Every day we get dozens of requests from people all over the world to set up web automation solutions for them. Would you like to help us with that and make some money? Please let us know at jobs@apify.com

Apify for business

Apify helps companies succeed by leveraging unlimited data from the web. We'll find an expert to set you up.

Learn more