Granite 3.3 8B Instruct
Overview
Overview
Granite 3.3 models feature enhanced reasoning capabilities and support for Fill-in-the-Middle (FIM) code completion. They are built on a foundation of open-source instruction datasets with permissive licenses, alongside internally curated synthetic datasets tailored for long-context problem-solving. These models preserve the key strengths of previous Granite versions, including support for a 128K context length, strong performance in retrieval-augmented generation (RAG) and function calling, and controls for response length and originality. Granite 3.3 also delivers competitive results across general, enterprise, and safety benchmarks. Released as open source, the models are available under the Apache 2.0 license.
Granite 3.3 8B Instruct was released on April 16, 2025. API access is available through Replicate.
Performance
Timeline
Specifications
Benchmarks
Benchmarks
Granite 3.3 8B Instruct Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing
Pricing, performance, and capabilities for Granite 3.3 8B Instruct across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Replicate | $0.50 | $0.50 | 128.0K | 8.2K | 0.3 | 50.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for Granite 3.3 8B Instruct will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about Granite 3.3 8B Instruct