[api, application programming interface, read, write, export, csv, json]


Overview

After you have requested data from an API and extracted the required fields, you often want to convert the data into a format that is compatible with other software programs (e.g., Excel and R). A popular file format for tabular data is a Comma Separated Value (CSV) because it is simple, widespread, and compatible with most platforms. CSV files are file formats that contain plain text values separated by commas and can be opened by almost any spreadsheet program. In other cases, you may solely want to store the raw JavaScript Object Notation (JSON) data as a backup (simply because you don’t know exactly what fields you’ll need).

Code

Write Data to CSV

To facilitate writing to a CSV file, we’ll make use of the csv library. If you want to add to an existing CSV file - rather than overwriting it - use the a flag (append) instead of the w flag (write).

import csv
with open("<FILENAME>.csv", "w") as csv_file:
    writer = csv.writer(csv_file, delimiter = ";")
    writer.writerow(["<COLUMN_1>", "<COLUMN_2>"])
    for content in all_data:
        writer.writerow([<DATAFRAME>["<COLUMN_1>"], <DATAFRAME>["<COLUMN_2>"])
Tip

If your data is already stored as a pandas DataFrame you can easily export it as follows: <DATAFRAME>.to_csv("<FILENAME>.csv", index=False).

Read CSV Data

The read.csv() method from the pandas library automatically converts the data into a DataFrame which provides rich functionalities for data analysis.

import pandas as pd
df = pd.read_csv("<FILENAME>.csv", sep=";")

Tip

If you encounter CSV files with a custom delimiter (i.e., a symbol used to separate the data into rows and columns), you can explicitly indicate that with the sep parameter. For example, in this case, the interpreter expects that data fields have been separated by semi-colons (;).

Write Data to JSON

The json packages make exporting raw JSON data (JSON_FILE) straightforward:

import json
with open("<NAME_OF_JSON_EXPORT>.json", "w") as outfile:
    json.dump(<JSON_FILE>, outfile)

Read JSON Data

Similarly, you can import the JSON files with the same library.

import json
with open("<NAME_OF_JSON_EXPORT>.json") as f:
    data = json.load(f)