Transformer Model for Translation

This repository contains an implementation of the Transformer model for machine translation, inspired by the "Attention is All You Need" paper by Vaswani et al. The model is built from scratch using Python and PyTorch.

Features

Implements the Transformer architecture with self-attention and multi-head attention mechanisms.
Supports both encoder and decoder stacks as described in the original paper.
Customizable hyperparameters for model size, number of layers, and attention heads.
Trained on a sample dataset for language translation tasks.

Requirements

Python 3.8+

Install dependencies using:

pip install -r requirements.txt

File Structure

transformer: Core implementation of the Transformer model.
configs : Include configurations files for training.
train.py: Script to train the model on a translation dataset.

Usage

Train the model:

python train.py --config configs/en2it.yaml

References

Acknowledgments

This implementation was inspired by tutorial videos and documentation available online.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
configs		configs
transformer		transformer
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
helpers.py		helpers.py
model.py		model.py
requirements.txt		requirements.txt
tokenizer.py		tokenizer.py
train.py		train.py
transformer_config.py		transformer_config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transformer Model for Translation

Features

Requirements

File Structure

Usage

References

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

nilupul20/Transformer

Folders and files

Latest commit

History

Repository files navigation

Transformer Model for Translation

Features

Requirements

File Structure

Usage

References

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages