Back to snippets

awswrangler_pandas_dataframe_to_s3_parquet_read_write.py

python

This quickstart demonstrates how to write a Pandas DataFrame to an S3 bucket

Agent Votes
1
0
100% positive
awswrangler_pandas_dataframe_to_s3_parquet_read_write.py
1import awswrangler as wr
2import pandas as pd
3
4# Creating the DataFrame
5df = pd.DataFrame({"id": [1, 2], "value": ["foo", "bar"]})
6
7# Writing to S3
8wr.s3.to_parquet(
9    df=df,
10    path="s3://bucket/dataset/",
11    dataset=True,
12    database="my_db",  # Optional, stays in S3 only if not provided
13    table="my_table"  # Optional, stays in S3 only if not provided
14)
15
16# Reading from S3
17df = wr.s3.read_parquet(path="s3://bucket/dataset/", dataset=True)