Automatic Label Error Correction Without Human Labor

Doc

https://guotong1988.github.io/core_research/2024/02/01/auto-re-label/

Run

Step-1, Train the model on origin training dataset, train.py

Step-2, Predict the training/dev datasets, predict.py

Step-3, Prepare the candidate training datasets, get_dataset_list.py

Step-4, Find the best dataset by dev accuracy, explore_train.py

Requirement

transformers 4.38.2 or 4.26.1

torch 2.2.1 or 1.11.0

scikit-learn 1.3.2

datasets 2.18.0

accelerate 0.27.2

Experiment Results

Related Work

Label Error Correction With Human Labor: The Re-Label Method For Data-Centric Machine Learning

Controllable Label Error Fixing: Re-Label By Data Pattern For Controllable Deep Learning

Apply To LLMs: Simple Self-Eval-Drop for Leveraging LLMs, Drop Noise For Cleaning LLMs Data

More Info

The methods proposed in this project (and its related works) can be applied to all manually annotated (or dataset by LLMs) machine learning / deep learning tasks.

Not only NLP tasks, but can also be efficiently extended to CV(computer vision) tasks, ASR(speech recognition) tasks, TTS(text-to-speech) tasks, and more.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
common		common
data		data
LICENSE		LICENSE
README.md		README.md
explore_train.py		explore_train.py
get_dataset_list.py		get_dataset_list.py
predict.py		predict.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automatic Label Error Correction Without Human Labor

Doc

Run

Requirement

Experiment Results

Related Work

More Info

About

Uh oh!

Releases

Packages

Languages

License

guotong1988/Automatic-Label-Error-Correction

Folders and files

Latest commit

History

Repository files navigation

Automatic Label Error Correction Without Human Labor

Doc

Run

Requirement

Experiment Results

Related Work

More Info

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages