Skip to content Skip to footer

HuggingFace launches FineWeb: A Novel Large-Scale (15-Trillion Tokens, 44TB Disk Space) Dataset designed for LLM Pretraining.

Leave a comment

0.0/5