User

Jakub Baladajakubbalada

I'm a co-founder of Apify, father of 2 kids, web hacker and beer lover.

All
Popularity
Actor

content-checker

jakubbalada/content-checker

You can use this act to monitor any page's content and get a notification when content changes. Technically it extracts text by given selector and compares it with the previous run. If there is any change, it runs another act to ...

avatarjakubbalada
9star
FEATURED
Crawler

Google Search

Scrapes data from Google search for given keyword (organic, ads, snackpacks).

avatarjakubbalada
888cloud_download
Crawler

Email and Social handlers extractor

Get emails and social handlers (Twitter, LinkedIn, Instagram) from page/domain/web. Just change the Start url and define the scope.

avatarjakubbalada
561cloud_download
Crawler

transfermarkt.com

Get info about your favorite soccer players from transfermarkt.com

avatarjakubbalada
353cloud_download
Crawler

Booking - hotel details

Get info about hotels on booking.com based on search query.

avatarjakubbalada
261cloud_download
Crawler

Complete HTML

Crawls entire site (www subdomain) and extracts complete HTML content for every page

avatarjakubbalada
206cloud_download
Crawler

Booking - hotel prices

Get prices for your favorite hotel on booking.com. Scrapes all available rooms with description and prices for given hotel and dates.

avatarjakubbalada
186cloud_download
Crawler

Google Play store - app reviews

Get app reviews from Google Play store (max. 4000 reviews). Uses internal AJAX call which returns 40 reviews in html code.

avatarjakubbalada
120cloud_download
Crawler

yellowpages.com

Scrapes basic info for given keyword and location (from a list)

avatarjakubbalada
84cloud_download
Crawler

Louis Vuitton

Get product data from e-commerce site

avatarjakubbalada
71cloud_download
Crawler

yelp.com with reviews

Get basic info and all reviews from Yelp

avatarjakubbalada
67cloud_download
Crawler

IMDB.com

Get info about movies from IMDB (from detail page)

avatarjakubbalada
54cloud_download
Crawler

Basic SEO

Crawler for basic SEO analysis.

avatarjakubbalada
53cloud_download
Crawler

6pm.com - JS variable

Get products information from e-commerce fashion site using JavaScript variable available on a page

avatarjakubbalada
48cloud_download
Crawler

yelp.com reviews from JSON-LD

Crawler takes biz id from customData attribute and scrapes all reviews using JSON Linked data. If the page is not loaded (proxy can be banned), it is enqueued again.

avatarjakubbalada
46cloud_download
Crawler

XML parser

Crawler gets all categories as a set from given xml feed with products.

avatarjakubbalada
30cloud_download
Crawler

prisjakt.nu

Get product prices from prisjakt.nu

avatarjakubbalada
24cloud_download
Crawler

Hubspot.com

Get prospects from Hubspot.com behind your login which is handled by submitting login form.

avatarjakubbalada
22cloud_download
Crawler

booli.se

Get all real estate offers from booli.se using internal JS variable

avatarjakubbalada
20cloud_download
Crawler

Readability

Get text from a page using readability.js

avatarjakubbalada
19cloud_download
Crawler

Hacker News

Get top HN submissions (data is taken from the list on the frontpage)

avatarjakubbalada
17cloud_download
Crawler

Audience Demographics from Alexa.com

Get audience demographics for given site from Alexa. Data are extracted from bars using their width attribute

avatarjakubbalada
17cloud_download
Crawler

StartupJobs.cz offers

Get all job offers from startupjobs.cz (data taken from a list)

avatarjakubbalada
15cloud_download
Crawler

kirnazabete.com

Get all new products from e-commerce site. Uses internal JS variable for some attributes.

avatarjakubbalada
14cloud_download
Crawler

Startups from Geekwire v1

Get all startups based in the Pacific Northwest from Geekwire collection. Crawler navigates through pagination in one page function using JS click().

avatarjakubbalada
14cloud_download
Crawler

topshop.com using XHRs

Get all products from topshop.com category using their internal XHRs

avatarjakubbalada
14cloud_download
Crawler

login to comicsdb.cz

Login example (comicsdb.cz). Simple POST data in Start url doesn't work, Pseudo-url has to be used to handle new request after login.

avatarjakubbalada
13cloud_download
Crawler

SFO Flights

Get departures and arrivals at SFO. Pagination is handled in Page function using click()

avatarjakubbalada
13cloud_download
Crawler

Techcrunch.com

Outputs links to articles containing specific keyword (and a few words around the keyword)

avatarjakubbalada
12cloud_download
Crawler

blibli.com

Get product reviews from Blibli.com using internal JS variable and AJAX calls

avatarjakubbalada
12cloud_download