GitHub - moaziat/jaxGPT: GPT in jax/flax

jaxGPT - A Minimal GPT implementatoin in JAX

A lightweight implementation of a GPT-like transformer model with 1.6M parameters (for now!!) using JAX and Flax's NNX module. This code demonstrates autoregressive text generation, multi-head attention, and a training loop with Optax. The code handles training on multiple devices automatically through positional sharding.

Installation

Ensure JAX is installed (follow instructions for your hardware here).
Install dependencies:

pip install flax optax

Run it

clone the repo then:

python train.py --max_iters 200000 --lr 1e-3

Output

Example 1: Training on the tiny-shakespeare dataset:

generated output:


KING RIVH:
It genton. I was the grief
Forth the time of is offected: how God noug-leakes?
Your wift quany at stand pass--
Stoation incuttection to-danius shorate anginganeness,
Thy switn's roveried skily before in grief
Would thee stoble Rome of find,
Mach thing bound you will all, and Clifford!

FLORIZEL:
Who should had thanks you
Formit gaters thee gods, and swear be
Intives that thither-buney, we heart this grule commends
By the ritory striken time of their
conselds, let where I will becalle

Example 2: Training on a names dataset

names generated:

breesmin
kindley
britlynn
iassiqu
floreca
catle
peava
theania
yaanira
hanvaron
neasea
orrell
cinjaya
daraz
quaniella
blaver
jushadomie
treymeca
shayana
julene
share
istofano
nariah
mckinda
iskia
kashir
arhiyah
letena
estina
asamie
azur
aralie
stara
keiliy
yyena
josebe
breya
sinna
seondro
ramek
zariah
shakerra
fairah
demariy
siara
clanda
saprion
naimah
shety
marlon
cailla

References

https://github.com/karpathy/ng-video-lecture
https://arxiv.org/abs/1706.03762v7: Attention Is All You Need

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github		.github
assets		assets
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
jaxGPT.py		jaxGPT.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

jaxGPT - A Minimal GPT implementatoin in JAX

Installation

Run it

Output

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

jaxGPT - A Minimal GPT implementatoin in JAX

Installation

Run it

Output

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages