What's in the RedPajama-Data-1T LLM training set

$ 29.50

4.7
(376)
In stock
Description

RedPajama is “a project to create leading open-source models, starts by reproducing LLaMA training dataset of over 1.2 trillion tokens”. It’s a collaboration between Together, Ontocord.ai, ETH DS3Lab, Stanford CRFM, …

The Latest Open Source LLMs and Datasets

Bringing LLM Fine-Tuning and RLHF to Everyone

Finetuning an LLM: RLHF and alternatives (Part I)

From ChatGPT to LLaMA to RedPajama: I'm Switching My Interest to

Data analysis with SQLite and Python - Tutorial

Data analysis with SQLite and Python - Tutorial

Web LLM runs the vicuna-7b Large Language Model entirely in your

65-Billion-Parameter Large Model Pretraining Accelerated by 38

RedPajama Completes First Step Towards an Open-Source Model

Web LLM runs the vicuna-7b Large Language Model entirely in your