Skip to content

Data filtering on biomedical topics #1

@tmabraham

Description

@tmabraham

Write a ligthweight script that, given a HuggingFace dataset like https://huggingface.co/datasets/open-thoughts/OpenThoughts2-1M or https://huggingface.co/datasets/GeneralReasoning/GeneralThought-430K, filters/tags only biomedically relevant samples

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions