Use Case
RAG (Retrieval Augmented Generation)
Calculate costs for vector search + generation
Workload Overview
Calculate costs for vector search + generation
This preset is optimized for RAG (Retrieval Augmented Generation) workloads. Different use cases have varying ratios of input to output tokens.
Input Heavy?
Depends on context size.
Output Heavy?
Depends on generation length.