@@ -15,18 +15,109 @@ This is an updated version of ``IterableDataset`` in ``torch``.
15
15
.. autoclass :: IterDataPipe
16
16
17
17
18
- We have three types of Iterable DataPipes:
18
+ We have different types of Iterable DataPipes:
19
19
20
- 1. Load - help you interact with the file systems or online databases (e.g. FileOpener, GDriveReader)
20
+ 1. Archive - open and decompress archive files of different formats.
21
21
22
- 2. Transform - transform elements within DataPipes (e.g. batching, shuffling)
22
+ 2. Augmenting - augment your samples (e.g. adding index, or cycle through indefinitely).
23
23
24
- 3. Utility - utility functions (e.g. caching, CSV parsing, filtering)
24
+ 3. Combinatorial - perform combinatorial operations (e.g. sampling, shuffling).
25
25
26
- Load DataPipes
26
+ 4. Combining/Splitting - interact with multiple DataPipes by combining them or splitting one to many.
27
+
28
+ 5. Grouping - group samples within a DataPipe
29
+
30
+ 6. IO - interacting with the file systems or remote server (e.g. downloading, opening,
31
+ saving files, and listing the files in directories).
32
+
33
+ 7. Mapping - apply the a given function to each element in the DataPipe.
34
+
35
+ 8. Others - perform miscellaneous set of operations.
36
+
37
+ 9. Selecting - select specific samples within a DataPipe.
38
+
39
+ 10. Text - parse, read, and transform text files and data
40
+
41
+ Archive DataPipes
42
+ -------------------------
43
+
44
+ These DataPipes help opening and decompressing archive files of different formats.
45
+
46
+ .. autosummary ::
47
+ :nosignatures:
48
+ :toctree: generated/
49
+ :template: datapipe.rst
50
+
51
+ Extractor
52
+ RarArchiveLoader
53
+ TarArchiveReader
54
+ XzFileReader
55
+ ZipArchiveReader
56
+
57
+ Augmenting DataPipes
58
+ -----------------------------
59
+ These DataPipes help to augment your samples.
60
+
61
+ .. autosummary ::
62
+ :nosignatures:
63
+ :toctree: generated/
64
+ :template: datapipe.rst
65
+
66
+ Cycler
67
+ Enumerator
68
+ IndexAdder
69
+
70
+ Combinatorial DataPipes
71
+ -----------------------------
72
+ These DataPipes help to perform combinatorial operations.
73
+
74
+ .. autosummary ::
75
+ :nosignatures:
76
+ :toctree: generated/
77
+ :template: datapipe.rst
78
+
79
+ Sampler
80
+ Shuffler
81
+
82
+ Combining/Spliting DataPipes
83
+ -----------------------------
84
+ These tend to involve multiple DataPipes, combining them or splitting one to many.
85
+
86
+ .. autosummary ::
87
+ :nosignatures:
88
+ :toctree: generated/
89
+ :template: datapipe.rst
90
+
91
+ Concater
92
+ Demultiplexer
93
+ Forker
94
+ IterKeyZipper
95
+ MapKeyZipper
96
+ Multiplexer
97
+ SampleMultiplexer
98
+ UnZipper
99
+ Zipper
100
+
101
+ Grouping DataPipes
102
+ -----------------------------
103
+ These DataPipes have you group samples within a DataPipe.
104
+
105
+ .. autosummary ::
106
+ :nosignatures:
107
+ :toctree: generated/
108
+ :template: datapipe.rst
109
+
110
+ Batcher
111
+ BucketBatcher
112
+ Collator
113
+ Grouper
114
+ UnBatcher
115
+
116
+ IO DataPipes
27
117
-------------------------
28
118
29
- These DataPipes help you interact with the file systems or online databases (e.g. FileOpener, GDriveReader).
119
+ These DataPipes help interacting with the file systems or remote server (e.g. downloading, opening,
120
+ saving files, and listing the files in directories).
30
121
31
122
.. autosummary ::
32
123
:nosignatures:
@@ -42,73 +133,68 @@ These DataPipes help you interact with the file systems or online databases (e.g
42
133
HttpReader
43
134
IoPathFileLister
44
135
IoPathFileOpener
136
+ IoPathSaver
45
137
OnlineReader
46
138
ParquetDataFrameLoader
139
+ Saver
47
140
48
-
49
- Transform DataPipes
141
+ Mapping DataPipes
50
142
-------------------------
51
143
52
- These DataPipes transform elements within DataPipes (e.g. batching, shuffling) .
144
+ These DataPipes apply the a given function to each element in the DataPipe .
53
145
54
146
.. autosummary ::
55
147
:nosignatures:
56
148
:toctree: generated/
57
149
:template: datapipe.rst
58
150
59
- Batcher
60
- BucketBatcher
61
- Shuffler
151
+ FlatMapper
152
+ Mapper
62
153
63
- Utility DataPipes
154
+ Other DataPipes
64
155
-------------------------
65
-
66
- These DataPipes provide utility functions (e.g. caching, CSV parsing, filtering).
156
+ A miscellaneous set of DataPipes with different functionalities.
67
157
68
158
.. autosummary ::
69
159
:nosignatures:
70
160
:toctree: generated/
71
161
:template: datapipe.rst
72
162
73
- CSVDictParser
74
- CSVParser
75
- Collator
76
- Concater
77
- Cycler
78
163
DataFrameMaker
79
- Demultiplexer
80
164
EndOnDiskCacheHolder
81
- Enumerator
82
- Extractor
83
- Filter
84
- FlatMapper
85
- Forker
86
- Grouper
87
165
HashChecker
88
- Header
89
166
InMemoryCacheHolder
90
- IndexAdder
91
- IoPathSaver
92
- IterKeyZipper
93
167
IterableWrapper
168
+ OnDiskCacheHolder
169
+ ShardingFilter
170
+
171
+ Selecting DataPipes
172
+ -------------------------
173
+
174
+ These DataPipes helps you select specific samples within a DataPipe.
175
+
176
+ .. autosummary ::
177
+ :nosignatures:
178
+ :toctree: generated/
179
+ :template: datapipe.rst
180
+
181
+ Filter
182
+ Header
183
+
184
+ Text DataPipes
185
+ -----------------------------
186
+ These DataPipes help you parse, read, and transform text files and data.
187
+
188
+ .. autosummary ::
189
+ :nosignatures:
190
+ :toctree: generated/
191
+ :template: datapipe.rst
192
+
193
+ CSVDictParser
194
+ CSVParser
94
195
JsonParser
95
196
LineReader
96
- MapKeyZipper
97
- Mapper
98
- Multiplexer
99
- OnDiskCacheHolder
100
197
ParagraphAggregator
101
- RarArchiveLoader
102
198
RoutedDecoder
103
199
Rows2Columnar
104
- SampleMultiplexer
105
- Sampler
106
- Saver
107
- ShardingFilter
108
200
StreamReader
109
- TarArchiveReader
110
- UnBatcher
111
- UnZipper
112
- XzFileReader
113
- ZipArchiveReader
114
- Zipper
0 commit comments