The API that started the LLM revolution. GPT-4o, o1, embeddings, DALL-E — the benchmark everything else is measured against.
- Strong on: Best-in-class models
- Addresses Weights & Biases's tradeoff: Expensive for large teams
- Higher overall score (10.0 vs 8.2)