You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: examples/data_loss_prevention/README.md
+7-1Lines changed: 7 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -104,7 +104,7 @@ Below is a visualization of the pipeline showing all stages and data flow:
104
104
105
105
### Installing GliNER and dependencies
106
106
107
-
Prior to running the DLP pipeline, additional requirements must be installed in to your Conda environment.
107
+
Prior to running the DLP pipeline, additional requirements must be installed in to your Conda environment. This will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.
The model being used is [gretelai/gretel-gliner-bi-small-v1.0](https://huggingface.co/gretelai/gretel-gliner-bi-small-v1.0) which is released under the [Apache License 2.0](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/apache-2.0.md).
123
+
122
124
Users who are using the pre-built Morpheus Docker container should skip this step. Downloading the model locally offers a slight performance benefit, otherwise the model is fetched on-demand, then on subsequent runs a cached copy of the model is used.
123
125
124
126
The model is stored in the Morpheus repository using [Git Large File Storage (LFS)](https://git-lfs.github.com/). Only those files which are strictly needed to run Morpheus are downloaded by default when the repository is cloned. The model can be downloaded using the `fetch_data.py` script.
@@ -128,6 +130,7 @@ git lfs install
128
130
./scripts/fetch_data.py fetch models
129
131
```
130
132
133
+
131
134
### Build the Example
132
135
133
136
#### Building the example by itself
@@ -219,6 +222,9 @@ Options:
219
222
--help Show this message and exit.
220
223
```
221
224
225
+
226
+
> **Note**: By default when the `--input_file` and `--dataset` flags are not set, the [gretelai/gretel-pii-masking-en-v1](https://huggingface.co/datasets/gretelai/gretel-pii-masking-en-v1) dataset will be fetched at runtime. This dataset is released under the [Apache License 2.0](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/apache-2.0.md).
227
+
222
228
To launch the configured Morpheus pipeline with the default arguments, run the following:
0 commit comments