Skip to content

whoisandy/k2s3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

k2s3

Kafka to S3 Streamer

Stream JSON messages from Kafka to partitioned Parquet files stored on S3

TODO:

  • Add Kafka consumer logic
  • Enable partition columns as cli option
  • Ability to enable/disable compaction
  • Ability to alter table with new partitions dynamically
  • Ability to include/exclude fields from JSON
  • Convert batches to partitioned parquet tables
  • Add/Update kafka offset checkpoint files along with parquet partitioned files

About

Kafka to S3 Streamer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages