Datasets

Open-source datasets from AMD GenAI on Hugging Face

CaptionQA

Utility-based benchmark with 33K+ questions across 4 domains for evaluating caption quality

benchmark multimodal captions
SAND-Post-Training-Dataset

High-quality synthetic reasoning dataset with 27K math and science problems for fine-tuning LLMs

math science reasoning synthetic
Instella-Long

Long-context training data for Instella models supporting 128K context length

long-context 128K training
SAND-MATH

Synthetic math dataset for training reasoning models on AMD GPUs

math reasoning synthetic
Instella-GSM8K-synthetic

Synthetic GSM8K-style math problems for Instella model training

math GSM8K synthetic
TTT-Bench

Benchmark for evaluating LLM reasoning with 412 novel Tic-Tac-Toe-style game questions across 4 game types

benchmark reasoning games