LLM & OpenAI API Pricing Calculator
Estimate & Compare LLM APIs Cost
Begin with a clear financial plan using our LLM API Cost Calculator. Designed to tackle the complexities of pricing for major APIs like OpenAI, Azure, and Anthropic Claude, our OpenAI API pricing calculator delivers precise cost estimates for GPT and ChatGPT APIs. Updated to reflect the latest rates as of February 2025. Get your accurate cost estimate now and step confidently into building your innovative AI product.
Streamlined Pricing for OpenAI API Services
Calculated by
Provider | Model Name | Context | Input/1M | Output/1M | Per Call | Total | Last Updated |
---|---|---|---|---|---|---|---|
Chat/Completion Models | |||||||
OpenAI | GPT-4o | 128K/16K | $2.5 | $10 | $0 | $0 | Aug 2024 |
OpenAI | GPT-4o mini | 128K/16K | $0.15 | $0.6 | $0 | $0 | Jul 2024 |
OpenAI | GPT-4o Mini Audio (Text) | 128K | $0.15 | $0.6 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Realtime (Text) | 128K | $0.6 | $2.4 | $0 | $0 | N/A |
OpenAI | o1 | 200K/100K | $15 | $60 | $0 | $0 | Dec 2024 |
OpenAI | o1-preview | 128K/32K | $15 | $60 | $0 | $0 | Sep 2024 |
OpenAI | o1-mini | 128K/65K | $3 | $12 | $0 | $0 | Sep 2024 |
OpenAI | GPT-4o Realtime (Text) | 128K/16K | $5 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4o Audio (Text) | 128K/16K | $2.5 | $10 | $0 | $0 | N/A |
OpenAI | GPT-4o Realtime (Audio) | 128K | $40 | $80 | $0 | $0 | N/A |
OpenAI | GPT-4o Audio (Audio) | 128K | $40 | $80 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Audio (Audio) | 128K | $10 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Realtime (Audio) | 128K | $10 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4 Turbo | 128K/4K | $10 | $30 | $0 | $0 | N/A |
OpenAI | GPT-3.5 Turbo | 16K/4K | $0.5 | $1.5 | $0 | $0 | Jan 2024 |
OpenAI | GPT-4 | 8K/8K | $30 | $60 | $0 | $0 | Jun 2023 |
Anthropic | Claude 3.5 Sonnet | 200K/8K | $3.00 | $15.00 | $0 | $0 | Oct 2024 |
Anthropic | Claude 3.5 Haiku | 200K/8K | $3.00 | $15.00 | $0 | $0 | Oct 2024 |
Meta | Llama 3.3 70b | 128K/2K | $0.23 | $0.4 | $0 | $0 | Dec 2024 |
Meta | Llama 3.1 405b | 128K/2K | $1.79 | $1.79 | $0 | $0 | Jul 2023 |
Meta | Llama 3.2 90b Vision-Instruct | 128K/2K | $0.35 | $0.40 | $0 | $0 | Sep 2024 |
Meta | Llama 3.1 70b | 128K/2K | $0.055 | $0.055 | $0 | $0 | Sep 2024 |
Meta | Llama 3 70b | 8K/2K | $0.59 | $0.79 | $0 | $0 | Apr 2024 |
Gemini 1.5 Pro | 128K | $1.25 | $5.00 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Pro | 2M | $2.50 | $10.00 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash | 128K | $0.075 | $0.30 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash | 1M | $0.15 | $0.60 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash-8B | 128K | $0.0375 | $0.15 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash-8B | 1M | $0.075 | $0.30 | $0 | $0 | Sep 2024 | |
Amazon | Amazon Nova Micro | 128K | $0.035 | $0.14 | $0 | $0 | Dec 2024 |
Amazon | Amazon Nova Lite | 300K | $0.06 | $0.24 | $0 | $0 | Dec 2024 |
Amazon | Amazon Nova Pro | 300K | $0.8 | $3.2 | $0 | $0 | Dec 2024 |
Cohere | Command | 4K | $10 | $20 | $0 | $0 | N/A |
Cohere | Command R | 128K/4K | $0.5 | $1.5 | $0 | $0 | Aug 2024 |
Cohere | Command R+ | 128K | $3 | $15 | $0 | $0 | Aug 2024 |
Mistral AI | Mistral NeMo | 128K | $0.15 | $0.15 | $0 | $0 | N/A |
Mistral AI (via Anyscale) | Mixtral 8x7B | 32K | $0.5 | $0.5 | $0 | $0 | Dec 2023 |
Mistral AI | Mistral Large 2 | 128K | $2 | $6 | $0 | $0 | Jul 2024 |
DeepSeek | DeepSeek-V3 | 128K/8K | $0.14 | $0.28 | $0 | $0 | Jul 2024 |
Fine-tuning models | |||||||
OpenAI | GPT-4o | 128K/16K | $3.75 | $15 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini | 128K/16K | $0.30 | $1.20 | $0 | $0 | N/A |
OpenAI | GPT-3.5 turbo | 4K | $3.00 | $6.00 | $0 | $0 | N/A |
Embedding models | |||||||
OpenAI | 3 Small | 0.02 | $0 | $0 | |||
OpenAI | 3 Large | $0.13 | $0 | $0 | |||
OpenAI | Ada v2 | $0.1 | $0 | $0 | |||
Gemini Text Embedding | 2K | $0.10 | $0 | $0 | |||
Cohere | Embed v3.0 | $0.10 | $0 | $0 | |||
Mistral AI | Mistral Embed | $0.10 | $0 | $0 |
Provider | Model Name | Context | Input/1M | Output/1M | Per Call | total | Last Updated |
---|---|---|---|---|---|---|---|
Chat/Completion Models | |||||||
OpenAI | GPT-4o | 128K/16K | $2.5 | $10 | $0 | $0 | Aug 2024 |
OpenAI | GPT-4o mini | 128K/16K | $0.15 | $0.6 | $0 | $0 | Jul 2024 |
OpenAI | GPT-4o Mini Audio (Text) | 128K | $0.15 | $0.6 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Realtime (Text) | 128K | $0.6 | $2.4 | $0 | $0 | N/A |
OpenAI | o1 | 200K/100K | $15 | $60 | $0 | $0 | Dec 2024 |
OpenAI | o1-preview | 128K/32K | $15 | $60 | $0 | $0 | Sep 2024 |
OpenAI | o1-mini | 128K/65K | $3 | $12 | $0 | $0 | Sep 2024 |
OpenAI | GPT-4o Realtime (Text) | 128K/16K | $5 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4o Audio (Text) | 128K/16K | $2.5 | $10 | $0 | $0 | N/A |
OpenAI | GPT-4o Realtime (Audio) | 128K | $40 | $80 | $0 | $0 | N/A |
OpenAI | GPT-4o Audio (Audio) | 128K | $40 | $80 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Audio (Audio) | 128K | $10 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Realtime (Audio) | 128K | $10 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4 Turbo | 128K/4K | $10 | $30 | $0 | $0 | N/A |
OpenAI | GPT-3.5 Turbo | 16K/4K | $0.5 | $1.5 | $0 | $0 | Jan 2024 |
OpenAI | GPT-4 | 8K/8K | $30 | $60 | $0 | $0 | Jun 2023 |
Anthropic | Claude 3.5 Sonnet | 200K/8K | $3.00 | $15.00 | $0 | $0 | Oct 2024 |
Anthropic | Claude 3.5 Haiku | 200K/8K | $3.00 | $15.00 | $0 | $0 | Oct 2024 |
Meta | Llama 3.3 70b | 128K/2K | $0.23 | $0.4 | $0 | $0 | Dec 2024 |
Meta | Llama 3.1 405b | 128K/2K | $1.79 | $1.79 | $0 | $0 | Jul 2023 |
Meta | Llama 3.2 90b Vision-Instruct | 128K/2K | $0.35 | $0.40 | $0 | $0 | Sep 2024 |
Meta | Llama 3.1 70b | 128K/2K | $0.055 | $0.055 | $0 | $0 | Sep 2024 |
Meta | Llama 3 70b | 8K/2K | $0.59 | $0.79 | $0 | $0 | Apr 2024 |
Gemini 1.5 Pro | 128K | $1.25 | $5.00 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Pro | 2M | $2.50 | $10.00 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash | 128K | $0.075 | $0.30 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash | 1M | $0.15 | $0.60 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash-8B | 128K | $0.0375 | $0.15 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash-8B | 1M | $0.075 | $0.30 | $0 | $0 | Sep 2024 | |
Amazon | Amazon Nova Micro | 128K | $0.035 | $0.14 | $0 | $0 | Dec 2024 |
Amazon | Amazon Nova Lite | 300K | $0.06 | $0.24 | $0 | $0 | Dec 2024 |
Amazon | Amazon Nova Pro | 300K | $0.8 | $3.2 | $0 | $0 | Dec 2024 |
Cohere | Command | 4K | $10 | $20 | $0 | $0 | N/A |
Cohere | Command R | 128K/4K | $0.5 | $1.5 | $0 | $0 | Aug 2024 |
Cohere | Command R+ | 128K | $3 | $15 | $0 | $0 | Aug 2024 |
Mistral AI | Mistral NeMo | 128K | $0.15 | $0.15 | $0 | $0 | N/A |
Mistral AI (via Anyscale) | Mixtral 8x7B | 32K | $0.5 | $0.5 | $0 | $0 | Dec 2023 |
Mistral AI | Mistral Large 2 | 128K | $2 | $6 | $0 | $0 | Jul 2024 |
DeepSeek | DeepSeek-V3 | 128K/8K | $0.14 | $0.28 | $0 | $0 | Jul 2024 |
Fine-tuning models | |||||||
OpenAI | GPT-4o | 128K/16K | $3.75 | $15 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini | 128K/16K | $0.30 | $1.20 | $0 | $0 | N/A |
OpenAI | GPT-3.5 turbo | 4K | $3.00 | $6.00 | $0 | $0 | N/A |
Embedding models | |||||||
OpenAI | 3 Small | 0.02 | $0 | $0 | |||
OpenAI | 3 Large | $0.13 | $0 | $0 | |||
OpenAI | Ada v2 | $0.1 | $0 | $0 | |||
Gemini Text Embedding | 2K | $0.10 | $0 | $0 | |||
Cohere | Embed v3.0 | $0.10 | $0 | $0 | |||
Mistral AI | Mistral Embed | $0.10 | $0 | $0 |
Provider | Model Name | Context | Input/1M | Output/1M | Per Call | total | Last Updated |
---|---|---|---|---|---|---|---|
Chat/Completion Models | |||||||
OpenAI | GPT-4o | 128K/16K | $2.5 | $10 | $0 | $0 | Aug 2024 |
OpenAI | GPT-4o mini | 128K/16K | $0.15 | $0.6 | $0 | $0 | Jul 2024 |
OpenAI | GPT-4o Mini Audio (Text) | 128K | $0.15 | $0.6 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Realtime (Text) | 128K | $0.6 | $2.4 | $0 | $0 | N/A |
OpenAI | o1 | 200K/100K | $15 | $60 | $0 | $0 | Dec 2024 |
OpenAI | o1-preview | 128K/32K | $15 | $60 | $0 | $0 | Sep 2024 |
OpenAI | o1-mini | 128K/65K | $3 | $12 | $0 | $0 | Sep 2024 |
OpenAI | GPT-4o Realtime (Text) | 128K/16K | $5 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4o Audio (Text) | 128K/16K | $2.5 | $10 | $0 | $0 | N/A |
OpenAI | GPT-4o Realtime (Audio) | 128K | $40 | $80 | $0 | $0 | N/A |
OpenAI | GPT-4o Audio (Audio) | 128K | $40 | $80 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Audio (Audio) | 128K | $10 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini Realtime (Audio) | 128K | $10 | $20 | $0 | $0 | N/A |
OpenAI | GPT-4 Turbo | 128K/4K | $10 | $30 | $0 | $0 | N/A |
OpenAI | GPT-3.5 Turbo | 16K/4K | $0.5 | $1.5 | $0 | $0 | Jan 2024 |
OpenAI | GPT-4 | 8K/8K | $30 | $60 | $0 | $0 | Jun 2023 |
Anthropic | Claude 3.5 Sonnet | 200K/8K | $3.00 | $15.00 | $0 | $0 | Oct 2024 |
Anthropic | Claude 3.5 Haiku | 200K/8K | $3.00 | $15.00 | $0 | $0 | Oct 2024 |
Meta | Llama 3.3 70b | 128K/2K | $0.23 | $0.4 | $0 | $0 | Dec 2024 |
Meta | Llama 3.1 405b | 128K/2K | $1.79 | $1.79 | $0 | $0 | Jul 2023 |
Meta | Llama 3.2 90b Vision-Instruct | 128K/2K | $0.35 | $0.40 | $0 | $0 | Sep 2024 |
Meta | Llama 3.1 70b | 128K/2K | $0.055 | $0.055 | $0 | $0 | Sep 2024 |
Meta | Llama 3 70b | 8K/2K | $0.59 | $0.79 | $0 | $0 | Apr 2024 |
Gemini 1.5 Pro | 128K | $1.25 | $5.00 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Pro | 2M | $2.50 | $10.00 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash | 128K | $0.075 | $0.30 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash | 1M | $0.15 | $0.60 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash-8B | 128K | $0.0375 | $0.15 | $0 | $0 | Sep 2024 | |
Gemini 1.5 Flash-8B | 1M | $0.075 | $0.30 | $0 | $0 | Sep 2024 | |
Amazon | Amazon Nova Micro | 128K | $0.035 | $0.14 | $0 | $0 | Dec 2024 |
Amazon | Amazon Nova Lite | 300K | $0.06 | $0.24 | $0 | $0 | Dec 2024 |
Amazon | Amazon Nova Pro | 300K | $0.8 | $3.2 | $0 | $0 | Dec 2024 |
Cohere | Command | 4K | $10 | $20 | $0 | $0 | N/A |
Cohere | Command R | 128K/4K | $0.5 | $1.5 | $0 | $0 | Aug 2024 |
Cohere | Command R+ | 128K | $3 | $15 | $0 | $0 | Aug 2024 |
Mistral AI | Mistral NeMo | 128K | $0.15 | $0.15 | $0 | $0 | N/A |
Mistral AI (via Anyscale) | Mixtral 8x7B | 32K | $0.5 | $0.5 | $0 | $0 | Dec 2023 |
Mistral AI | Mistral Large 2 | 128K | $2 | $6 | $0 | $0 | Jul 2024 |
DeepSeek | DeepSeek-V3 | 128K/8K | $0.14 | $0.28 | $0 | $0 | Jul 2024 |
Fine-tuning models | |||||||
OpenAI | GPT-4o | 128K/16K | $3.75 | $15 | $0 | $0 | N/A |
OpenAI | GPT-4o Mini | 128K/16K | $0.30 | $1.20 | $0 | $0 | N/A |
OpenAI | GPT-3.5 turbo | 4K | $3.00 | $6.00 | $0 | $0 | N/A |
Embedding models | |||||||
OpenAI | 3 Small | 0.02 | $0 | $0 | |||
OpenAI | 3 Large | $0.13 | $0 | $0 | |||
OpenAI | Ada v2 | $0.1 | $0 | $0 | |||
Gemini Text Embedding | 2K | $0.10 | $0 | $0 | |||
Cohere | Embed v3.0 | $0.10 | $0 | $0 | |||
Mistral AI | Mistral Embed | $0.10 | $0 | $0 |
Total Price: $0.00
Streamlined Pricing for
OpenAI API Services
Understanding OpenAI API Pricing
1. Tokens and Context Length Simplified
2. Model Choice and Usage
- OpenAI GPT-4: A leading-edge model, GPT-4 boasts extensive general knowledge and specialized expertise. Ideal for complex instructions and problem-solving, it offers precise outputs but at a higher cost. GPT-4 Turbo, it’s faster and more affordable variant, supports a substantial 128K context limit, enhancing its range of applications.
- OpenAI GPT-3.5 Turbo: Optimized for dialogue and conversational interfaces, GPT-3.5 Turbo is the go-to model for chatbot technology. It stands out for its speed and cost-effectiveness in text generation, making it a popular choice for real-time, interactive applications.
- Anthropic Claude 2: Known for its impressive 100K context length, Claude 2 excels in summarizing large documents and handling detailed Q&A sessions. While its extensive context capacity is a significant advantage, it comes at the cost of speed and affordability.
- Llama 2: Developed by Meta, this open-source model is akin to GPT-3.5 Turbo in performance. Notably cost-effective, it specializes in English text summarization and question answering. Although it’s limited to English, its affordability and versatility make it a strong contender in the AI landscape.
- Gemini Series by Google: This series includes Gemini Ultra, Pro, and Nano. Gemini Ultra rivals OpenAI’s GPT-4 in capabilities, while Gemini Pro aligns more with GPT-3.5 in performance. This range offers versatile, multimodal functionalities, catering to diverse AI needs.
- PaLM 2 by Google: PaLM 2 is distinguished by its advanced multilingual capabilities and reasoning prowess. Trained on a vast, diverse dataset, it excels in tasks requiring complex language understanding and translation, including coding, making it ideal for academic and technical applications.
- Mistral AI Models: Mistral AI offers accessible, open-source models like Mistral 7B and Mixtral, which provide rapid and cost-efficient solutions. These models compete with larger models like GPT-3.5 Turbo in performance, offering a viable alternative for budget-conscious AI applications.
Use the OpenAI token calculator for precise GPT API pricing.

FAQ’s