Use Case

RAG (Retrieval Augmented Generation)

Calculate costs for vector search + generation

Workload Overview
Calculate costs for vector search + generation

This preset is optimized for RAG (Retrieval Augmented Generation) workloads. Different use cases have varying ratios of input to output tokens.

Input Heavy?

Depends on context size.

Output Heavy?

Depends on generation length.