-
Notifications
You must be signed in to change notification settings - Fork 167
Description
Code of Conduct
- I agree to follow this project's Code of Conduct
Search before asking
- I have searched in the issues and found no similar issues.
Describe the proposal
Now, remote shuffle storage only support Hadoop Compatible Filesystem. Object storage is also important and widely used in the big data system. Object storage has different implement. Some systems don't support the method of list or have bad performance of the method list. Some systems need to consider file names to use more buckets of object. Some object storage don't support the method append. Now, we use store index and data separately, it will cause many small index files if we don't support the method append. So we should consider merging index files and data files. To have better performance, we should support object storage to adapt different object storage systems.
https://docs.google.com/document/d/1E88wZA9Yhr-pGeUEfxo6uSgsIXxg_ivPYBNcTOeaaZA/edit
Task list
- [Improvement] Merge data file and index file #892
- [FEATURE] Expect to support the filesystem not implementing the append-mode. #391
- documents for object storage (need to create an issue)
- [FEATURE] Add an interface for object storage #1133
Are you willing to submit PR?
- Yes I am willing to submit a PR!