Giter Site home page Giter Site logo

nejckorasa / s3-stream-unzip Goto Github PK

View Code? Open in Web Editor NEW
37.0 4.0 7.0 235 KB

Lightweight Java library to manage unzipping of large files and data in AWS S3 without knowing the size beforehand and without keeping it all in memory or writing to disk.

Home Page: https://nejckorasa.github.io/posts/s3-unzip/

License: MIT License

Java 100.00%
aws-s3 java s3 s3-bucket unzip unzipping-files decompression

s3-stream-unzip's People

Contributors

nejckorasa avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

s3-stream-unzip's Issues

pipe to unzip to s3?

What if you have this zip locally (or at least a stream for it) and want to unzip it into an s3 bucket? How would we do this?

You cannot unzip an archive if it contains folders with files

When unzipping an archive in which there are packs with files or subfolders with files, I encounter the following error:

Failed initialising multipart upload with uploadId null
Aborting [S3MultipartUpload uploading to <folder>/, with uploadId null due to error: {}
com.amazonaws.services.s3.model.AmazonS3Exception: Object name contains unsupported characters. (Service: Amazon S3; Status Code: 400)

Can you please tell me if the library can unzip the archive if it contains folders?

Add SplitUnzipStrategy

Add a strategy that allows larger files to be split to into multiple shards/files that are uploaded to S3.

  • it should handle possible headers (e.g. csv files)

Compatibility Issue between s3-stream-unzip and MinIO

Description

I am encountering a compatibility issue between the s3-stream-unzip library and the latest version of MinIO (dated 9/19/2023). When attempting to use s3-stream-unzip with the latest version of MinIO,

  • MinIO version (latest): 9/19/2023
  • MinIO version (working): 1/4/2023 that is compatible]

What happens if submit task fails with exception due to bounded queues & maximumPoolSize.

Hi
I was trying port code to C#, during prototype I come across the situation where submitting a task fails because of bounded queues & maximumPoolSize . It looks this exception case is not handled. Executor Service can throw RejectedExecutionException
Corresponding code

Suggestion: you could AWS transfer utilitie which does multiple upload of given stream under the hood.

My minimal C# implementation of the same here

https://github.com/madhub/S3StreamUnzip/blob/main/README.md

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.