Code

Open-source code and tools from AMD GenAI

CaptionQA

Utility-based benchmark for measuring how well image captions preserve image-level information

benchmark evaluation multimodal
APRIL

Active Partial Rollouts in RL to tame long-tail generation, achieving 44% throughput improvement

RL training optimization
Instella-VL

Open-source vision-language model combining CLIP encoder with language model for multimodal tasks

VLM multimodal vision-language
Instella-T2I

Text-to-image generation with 1D binary latents achieving 32× token reduction

text-to-image generation diffusion
Instella-Math

Math reasoning model trained with long chain-of-thought reinforcement learning

math reasoning RL
Instella

Fully open 3B language models with stellar performance, trained on AMD Instinct MI300X GPUs

language-model 3B pretraining
Agent-Laboratory

LLM agents as research assistants for automated scientific discovery

agents research automation