1
0
Fork 0
mirror of https://github.com/karpathy/minGPT synced 2024-06-10 00:36:07 +02:00

Merge pull request #6 from shivamtawari/patch-1

Update README.md
This commit is contained in:
Andrej 2020-08-19 00:17:36 -07:00 committed by GitHub
commit c97efac9a9

View File

@ -49,7 +49,7 @@ Code:
- [openai/gpt-2](https://github.com/openai/gpt-2) has the model but not the training code, and in TensorFlow
- [openai/image-gpt](https://github.com/openai/image-gpt) has some more modern gpt-3 like modification in its code, good reference as well
- huggingface/transformers has a [language-modeling example](https://github.com/huggingface/transformers/tree/master/examples/language-modeling). It is full-featured but as a result also somewhat challenging to trace. E.g. some large functions have as much as 90% unused code behind various branching statments that is unsued in the default setting of simple language modeling.
- huggingface/transformers has a [language-modeling example](https://github.com/huggingface/transformers/tree/master/examples/language-modeling). It is full-featured but as a result also somewhat challenging to trace. E.g. some large functions have as much as 90% unused code behind various branching statements that is unused in the default setting of simple language modeling.
Papers + some implementation notes:
@ -96,4 +96,4 @@ Papers + some implementation notes:
### License
MIT
MIT