Skip to content

Pull requests: mosaicml/streaming

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Bump databricks-sdk from 0.58.0 to 0.64.0 dependencies Pull requests that update a dependency file python Pull requests that update python code
#950 opened Aug 25, 2025 by dependabot bot Loading…
Update paramiko requirement from <4,>=2.11.0 to >=2.11.0,<5 dependencies Pull requests that update a dependency file python Pull requests that update python code
#947 opened Aug 4, 2025 by dependabot bot Loading…
Update huggingface-hub requirement from <0.34,>=0.23.4 to >=0.23.4,<0.35 dependencies Pull requests that update a dependency file python Pull requests that update python code
#945 opened Aug 4, 2025 by dependabot bot Loading…
removing redundent if statement
#944 opened Jul 22, 2025 by somay-jalan Loading…
5 of 8 tasks
Update datasets requirement from <4,>=2.4.0 to >=2.4.0,<5 dependencies Pull requests that update a dependency file python Pull requests that update python code
#942 opened Jul 14, 2025 by dependabot bot Loading…
Make SparkConnect the data source
#934 opened Jun 25, 2025 by XiaohanZhangCMU Loading…
8 tasks
Update numpy requirement from <2.2.0,>=1.21.5 to >=1.21.5,<2.3.0 dependencies Pull requests that update a dependency file python Pull requests that update python code
#896 opened Apr 7, 2025 by dependabot bot Loading…
Add upper bound for prefix_int
#823 opened Nov 5, 2024 by XiaohanZhangCMU Loading…
8 tasks
add jpeg quality option
#818 opened Oct 28, 2024 by cabreraalex Loading…
8 tasks
Refactor spanner to avoid creating large array
#773 opened Sep 3, 2024 by XiaohanZhangCMU Loading…
8 tasks done
Check file size within LocalUploader
#751 opened Aug 13, 2024 by XiaohanZhangCMU Loading…
8 tasks
Heterogeneous
#684 opened May 24, 2024 by XiaohanZhangCMU Draft
8 tasks
parallel merge index
#590 opened Feb 5, 2024 by XiaohanZhangCMU Loading…
8 tasks
Add varint to MDS
#574 opened Jan 23, 2024 by knighton Loading…
Add options to precompute the epoch
#569 opened Jan 20, 2024 by knighton Loading…
Nuke 1) torch dist, 2) shared memory, and 3) filelock
#556 opened Dec 30, 2023 by knighton Loading…
Add fine-grained timings to Writers
#555 opened Dec 30, 2023 by knighton Loading…
Let's blow away dist, and also shared memory
#552 opened Dec 26, 2023 by knighton Draft
2 of 3 tasks
Parquet streaming [WIP]
#538 opened Dec 15, 2023 by knighton Loading…
"Golden spike" PR
#488 opened Oct 28, 2023 by knighton Draft
Hf ingestion
#483 opened Oct 23, 2023 by XiaohanZhangCMU Loading…
8 tasks
Modify dataframe_to_mds to accept streaming DF
#478 opened Oct 20, 2023 by maddiedawson Loading…
8 tasks
Training on PQ shards
#443 opened Sep 22, 2023 by knighton Loading…
8 tasks
ProTip! Filter pull requests by the default branch with base:main.