User

Marek Trunkátmtrunkat

Full Stack Web Developer and Technology Enthusiast

All
Popularity
Actor

crawler-to-spreadsheet

mtrunkat/crawler-to-spreadsheet

This crawler takes last crawler run result and stores new items in Google Docs Spreadsheet.

avatarmtrunkat
54star
Actor

article-text-extractor

mtrunkat/article-text-extractor

Simply extracts article text and other meta info from given url. Uses https://github.com/ageitgey/node-unfluff which is a NodeJS implementation of https://github.com/grangier/python-goose.

avatarmtrunkat
52star
Crawler

Aliexpress.com - own orders

Get all your orders from aliexpress.com in machine readable format.

avatarmtrunkat
36cloud_download
Actor

example-hacker-news

mtrunkat/example-hacker-news

Example crawler for news.ycombinator.com build using Apify SDK

avatarmtrunkat
28star
Actor

twitter

mtrunkat/twitter

Extracts all tweets for given hashtag.

avatarmtrunkat
28star
Actor

url-list-download-html

mtrunkat/url-list-download-html

This act accepts a url list and downloads HTML of each page. It has input parameter - "sources" (see soursec parameter of UrlList https://www.apify.com/docs/sdk/apify-runtime-js/beta#RequestList).

avatarmtrunkat
21star
Actor

crawl-url-list-1by1

mtrunkat/crawl-url-list-1by1

Crawls given list of urls with one crawler execution per url.

avatarmtrunkat
19star
Crawler

Skoda-auto.cz - model variants

Get all model-engine-equipment package variants of Škoda Auto cars.

avatarmtrunkat
13cloud_download
Crawler

HN Show

Scrapes the links with their rank from HN Show. Created for this blogpost https://medium.com/p/8cccfa25f5cb/edit

avatarmtrunkat
6cloud_download
Actor

crawler-timeline

mtrunkat/crawler-timeline

This act creates a timeline spreadsheet from crawler results. Main use-case is to create a spreadsheet containing changes of some web page in time.

avatarmtrunkat
6star
Actor

puppeteer-promise-pool-example

mtrunkat/puppeteer-promise-pool-example

Example how to use Puppeteer in parallel using 'es6-promise-pool' npm package.

avatarmtrunkat
4star
Actor

xmls-to-dataset

mtrunkat/xmls-to-dataset

This act loads list of urls from INPUT.sources. Each of these links should point to a xml file. It downloads all the files and saves them to it's default dataset. Groups parameter in INPUT allows to choose Apify proxy groups to us...

avatarmtrunkat
2star
Actor

proxy-test

mtrunkat/proxy-test

This actor simply tests given array of URLs against selected proxy URLs or Apify proxy groups.

avatarmtrunkat
2star
Actor

24-hour-stats

mtrunkat/24-hour-stats

This act can be used as synchronous API. Returns a JSON containing actor runs finished in the last 24 hours along with information about their default datasets and request queues. Actors might be filtered via input array "actIds".

avatarmtrunkat
2star
Actor

delete-untitled-acts

mtrunkat/delete-untitled-acts

Deletes all untitled acts from your account. In a minute. For free. With one click!

avatarmtrunkat
1star
Actor

crawler-to-sitemap

mtrunkat/crawler-to-sitemap

This act can be used as crawler's finish webhook. It transforms crawler's result into sitemap XML file and stores it in key-value-store named "sitemaps".

avatarmtrunkat
1star