No credit card required

Google Sheets Import & Export

lukaskrivka/google-sheets

No credit card required

Import data from datasets or JSON files to Google Sheets. Programmatically process data in Sheets. Easier and faster than the official Google Sheets API and perfect for importing data from scraping.

Google Sheets Import & Export Data

Why use Google Sheets Import & Export
How to use Google Sheets Import & Export
Input settings
Limits
Authentication and authorization
Public spreadsheet (no authorization)
Local usage
Important tips
Modes
Raw data import
Raw data table format (array of arrays)
Dataset format (array of objects)
Further reading
Changelog

Why use Google Sheets Import and Export?

If you're looking for an easy way to import and export data from datasets across multiple Google sheets, Google Sheets Import & Export is the Apify automation tool for you.

It can process data in your current spreadsheet, import new data from Apify datasets, or a raw JSON file. It can be run both on the Apify platform or locally.

You can use this actor with any programming language of your choice by calling Apify API.

How to use Google Sheets Import and Export

We have an in-depth tutorial on how to use this tool.

Input settings

You can specify the following settings:

Mode
Spreadsheet id
Public spreadsheet (read-only)
Dataset ID
Raw data
Limit items
Offset items
Deduplicate by field
Deduplicate by equality
Transform function
Range
Columns order
Keep column order from sheet
Google OAuth tokens store
Create backup
Backup store id
Google Developer Console credentials

For a complete description of all settings, see input specification.

Limits

If you exceed these limits, the actor run will fail, and no data will be imported.

Maximum runs (imports) per 100 seconds: 100

Authentication and authorization

If you are using this actor for the first time, you have to log in with the Google account where the spreadsheet is located. You then need to authorize Apify to work with your spreadsheets. Internally at Apify, we use our small npm package apify-google-auth. Please check this article on how to handle the authorization process.

After authorization, tokens are stored in your key-value store, and you don't need to authorize again. So, after the first usage, you can fully automate the actor.

Please note that Google gradually makes it harder to authenticate and it is possible it will not work for you at all. Try a different gmail, a company one rather then personal, or use directly our integration for Google drive.

Public spreadsheet (no authorization)

If you don't mind publishing your spreadsheet, you can use this actor without authorization for read mode. Simply set the input publicSpreadsheet to "true".

To protect against the abuse of Apify's unofficial Google API, a public spreadsheet without authorization will only work on the Apify platform by using a secret environment variable.

If you want to run the public mode locally, you have to create your project in the Google console and pass an API_KEY environment variable to your actor process.

Example: API_KEY=AIzaSyAPijSDFsdfSSf3kvGVsdfsdfsdsdnAVbcZb5Y apify run -p

Local usage

The official actor relies on the CLIENT_SECRET environment variable being set. This assures that official API integration is used.

If you want to use this actor locally or fork the source code, you will need to create your own project in Google Developer Console, create your own credentials, and pass them correctly to the googleCredentials input variable. This is explained further in the Apify Google Auth library.

Important tips

The maximum number of cells in the whole spreadsheet is 2 million! If the actor ever tries to import data over this limit, it will just throw an error, finish and not import anything. In this case, use more spreadsheets.
No matter which mode you choose, the actor recalculates how the data should be positioned in the sheet, updates all the cells, and then trims the exceeding rows and columns. This ensures that the sheet always has the optimal number of rows and columns and there is enough space for the newly generated data.
The actor parsing follows the default Google Sheets parsing. Therefore, depending on the configuration of your system, constructions such as "1.1" and "1,1" can be interpreted either as a number or a string (text). For this reason, it is recommended that you always use valid JSON numbers (e.g. 1.1).

Modes

This actor can be run in multiple different modes. Each run must have only one specific mode. Mode also affects how other options work (details are explained in the specific options).

replace: If there's any old data in the sheet, it is cleaned, and then new data is imported.
append: This mode adds new data as additional rows below the old rows already present in the sheet. Keep in mind that the columns are recalculated so some of them may move to different cells if new columns are added in the middle.
modify: This mode doesn't import anything. It only loads the data from your sheets and applies any of the processing you set in the options.
read: This mode simply loads the data from the spreadsheet, optionally can process them, and saves them as 'OUTPUT' JSON file to the default key-value store.
load backup: This mode simply loads any backup rows from previous runs (look at the backup option for details) and imports it to a sheet in the replace mode.

Raw data import

If you want to send data in raw JSON format, you need to pass the data to the rawData input parameter. You will also need to have an Apify account so we can properly store your Google authentication tokens (you can opt out at anytime).

Important! - Raw data cannot exceed 9MB, as this the default limit for Apify actor inputs. If you want to upload more data, you can easily split it into more runs (they're fast and cheap).

Raw data table format (array of arrays)

rawData should be an array of arrays where each of the arrays represents one row in the sheet. The first row should be a header row where the field names are defined. Every other row is a data row.

It is important to have a proper order in each array. If the field is null for any row, the array should contain an empty string in that index. Data rows can have a smaller length than the header row but if they are longer the extra data will be trimmed off.

Arrays cannot contain nested structures like objects or other arrays! You have to flatten them in a format where / is a delimiter. E.g. personal/hobbies/0.

1"rawData": [
2  ["name", "occupation", "email", "hobbies/0", "hobbies/1"],
3  ["John Doe", "developer", "john@google.com", "sport", "movies with Leonardo"],
4  ["Leonardo DiCaprio", "actor", "leonardo@google.com", "being rich", "climate change activism"]
5]

Dataset format (array of objects)

rawData should be an array of objects where each object represents one row in the sheet. The keys of the objects will be transformed to a header row and the values will be inserted into the data rows. Objects don't need to have the same keys. If an object doesn't have a key that another object has, the row will have an empty cell in that field.

The object can contain nested structures (objects and arrays) but in that case, it will call Apify API to flatten the data which can take a little more time on large uploads so try to prefer flattened data.

Nested:

1"rawData": [
2  {
3    "name": "John Doe",
4    "email": "john@google.com",
5    "hobbies": ["sport", "movies with Leonardo", "dog walking"]
6  },
7  {
8    "name": "Leonardo DiCaprio",
9    "email": "leonardo@google.com",
10    "hobbies": ["being rich", "climate change activism"]
11  }
12]

Flattened:

1"rawData": [
2  {
3    "name": "John Doe",
4    "email": "john@google.com",
5    "hobbies/0": "sport",
6    "hobbies/1": "movies with Leonardo",
7    "hobbies/2": "dog walking"
8  },
9  {
10    "name": "Leonardo DiCaprio",
11    "email": "leonardo@google.com",
12    "hobbies/0": "being rich",
13    "hobbies/1": "climate change activism"
14  }
15]

Changelog

A detailed list of changes is in the CHANGELOG.md file.

Developer

Lukáš Křivka

Actor metrics

42 monthly users
96.6% runs succeeded
0.2 days response time
Created in Jan 2019
Modified 4 months ago

Categories

Automation

Content Checker

jakubbalada/content-checker

Monitor a website or web page for content changes. Automatically saves before and after screenshots and sends an email notification when content changes are detected.

Jakub Balada

AI Web Agent

apify/ai-web-agent

Use natural language prompts to browse the web, click on elements, fill and submit forms, extract data, and take screenshots using the OpenAI API.

Apify

368

Merge, Dedup & Transform Datasets

lukaskrivka/dedup-datasets

The ultimate dataset processor. Extremely fast merging, deduplications & transformations all in a single run.

Lukáš Křivka

1.6k

Send Email

apify/send-mail

The actor automatically sends an email to a specific address. This actor is useful for notifications and reporting. With only 3 lines of javascript code, you'll be on top of your scraping actors and never miss important results or issues.

Apify

2.4k

Instagram Followers Count Scraper

apify/instagram-followers-count-scraper

Scrape the number of followers & follows from any Instagram profile. Schedule the scraper to run regularly to monitor how the numbers change. You can also export scraped data, run the scraper via API, monitor runs or integrate with other tools.

Apify

1.9k

Anti Captcha Recaptcha

petr_cermak/anti-captcha-recaptcha

🧰 Actor for solving Google reCAPTCHA using the anti-captcha.com service. You need to have an anti-captcha subscription.

Petr Cermak

1.3k

Page Scraping Analyzer

apify/page-analyzer

Performs analysis of a webpage to figure out the best way how to scrape its data. Provide a URL and data points to find and get back a detailed dashboard showing how the data can be scraped. Works with initial and rendered HTML, JavaScript variables and dynamically loaded data.

Apify

Website Checker

lukaskrivka/website-checker

Check any website you plan to scrape for expected Compute unit consumption, anti-scraping software, and reliability.

Lukáš Křivka

728

Dataset Image Downloader & Uploader

lukaskrivka/images-download-upload

Download image files from image URLs in your datasets and save them to a Zip file, Key-Value store, or directly your AWS S3 bucket.

Lukáš Křivka

284

Import data easily with Apify’s Google Sheets integration

How to scrape Google search results

How to scrape data from Google Maps

Build new tools

Are you a developer? Build your own Actors and run them on Apify.

Learn more

Get a custom solution

Get a custom web scraping or RPA solution.

Book a demo

Google Sheets Import & Export

Google Sheets Import & Export Data

Why use Google Sheets Import and Export?

How to use Google Sheets Import and Export

Input settings

Limits

Authentication and authorization

Public spreadsheet (no authorization)

Local usage

Important tips

Modes

Raw data import

Raw data table format (array of arrays)

Dataset format (array of objects)

Further reading

Changelog

Content Checker

AI Web Agent

Merge, Dedup & Transform Datasets

Send Email

Instagram Followers Count Scraper

Anti Captcha Recaptcha

Page Scraping Analyzer

Website Checker

Dataset Image Downloader & Uploader

Related articles

Where next?

Build new tools

Get a custom solution

Google Sheets Import & Export Data

Why use Google Sheets Import and Export?

How to use Google Sheets Import and Export

Input settings

Limits

Authentication and authorization

Public spreadsheet (no authorization)

Local usage

Important tips

Modes

Raw data import

Raw data table format (array of arrays)

Dataset format (array of objects)

Further reading

Changelog

You might also like these Actors

Content Checker

AI Web Agent

Merge, Dedup & Transform Datasets

Send Email

Instagram Followers Count Scraper

Anti Captcha Recaptcha

Page Scraping Analyzer

Website Checker

Dataset Image Downloader & Uploader

Related articles

Where next?

Build new tools

Get a custom solution