mirror of
https://github.com/karpathy/minGPT
synced 2024-06-10 00:36:07 +02:00
commit
c97efac9a9
|
@ -49,7 +49,7 @@ Code:
|
|||
|
||||
- [openai/gpt-2](https://github.com/openai/gpt-2) has the model but not the training code, and in TensorFlow
|
||||
- [openai/image-gpt](https://github.com/openai/image-gpt) has some more modern gpt-3 like modification in its code, good reference as well
|
||||
- huggingface/transformers has a [language-modeling example](https://github.com/huggingface/transformers/tree/master/examples/language-modeling). It is full-featured but as a result also somewhat challenging to trace. E.g. some large functions have as much as 90% unused code behind various branching statments that is unsued in the default setting of simple language modeling.
|
||||
- huggingface/transformers has a [language-modeling example](https://github.com/huggingface/transformers/tree/master/examples/language-modeling). It is full-featured but as a result also somewhat challenging to trace. E.g. some large functions have as much as 90% unused code behind various branching statements that is unused in the default setting of simple language modeling.
|
||||
|
||||
Papers + some implementation notes:
|
||||
|
||||
|
@ -96,4 +96,4 @@ Papers + some implementation notes:
|
|||
|
||||
### License
|
||||
|
||||
MIT
|
||||
MIT
|
||||
|
|
Loading…
Reference in New Issue