<--- Back to all connectors

Parquet

Stream data to Parquet files on S3 for efficient storage and analytics. Column-oriented format optimized for big data processing.

Parquet

Stream Data to Parquet Files

Key Features

  • Columnar Storage - Efficient compression and encoding for analytics workloads
  • S3 Integration - Write directly to your S3 bucket
  • Schema Evolution - Handle schema changes automatically
  • Partitioning - Organize data by time or custom partitions

How It Works

Streamkap converts CDC events to Parquet format and writes to S3:

  1. Changes are captured from your source database
  2. Events are batched and converted to Parquet format
  3. Files are written to your S3 bucket with partitioning
  4. Data is ready for analytics with tools like Spark, Athena, or Presto

Getting Started

  1. Configure your S3 bucket and IAM credentials
  2. Set up partitioning and file size preferences
  3. Select the tables you want to sync
  4. Start streaming data to Parquet files

Why Streamkap?

Reliable

Serverless platform providing enterprise reliability and scale.

Affordable

Only pay for GB into Streamkap.

Longterm

Retain data for as long as you need.

Flexible

Read once, write to many destinations.