1
0
Fork 0
mirror of https://github.com/karpathy/minGPT synced 2024-04-24 19:05:17 +02:00
minGPT/mingpt
Mishig Davaadorj 90420ee978
Use XOR operator `^` for checking assertion `type_given XOR params_given`
Use XOR operator `^` for checking assertion `type_given XOR params_given` in `GPT.__init__`
2022-07-28 22:33:51 +02:00
..
__init__.py first commit, able to multigpu train fp32 GPTs on math and character-level data, but have done barely any tuning. 2020-08-17 00:39:02 -07:00
bpe.py use a bit more extended example that has my last name too because nice to show how it breaks up into more tokens 2022-07-12 04:31:31 +00:00
model.py Use XOR operator `^` for checking assertion `type_given XOR params_given` 2022-07-28 22:33:51 +02:00
trainer.py Add optimizer to Trainer's self for callbacks. 2022-07-26 10:17:44 +02:00
utils.py refactor sequence generation into the model and match the huggingface/transformers api. touches everything but this makes a lot more sense to me aesthetically 2022-07-11 18:50:53 +00:00