Gemini 3.5 Flash

vgemini-3.5-flash

Google

Modelworkhorsegaagenticfast
89
Strong
About This Model

Google's GA 'frontier workhorse' launched at I/O 2026. Beats Gemini 3.1 Pro on agentic and coding suites (76.2% Terminal-Bench 2.1, 83.6% MCP Atlas) at roughly 4x the speed, with 1M token context. Pricier than past Flash tiers at $1.50/$9.00 per 1M.

Last Evaluated: June 10, 2026
Official Website

Trust Vector Analysis

Dimension Breakdown

🚀Performance & Reliability
+

Unusual positioning: a Flash-tier model that beats the flagship 3.1 Pro on agentic/coding suites at ~4x speed. Benchmark figures are official claims from launch (2026-05-19); third-party replication still maturing.

task accuracy code

Official launch benchmarks for agentic coding; vendor-reported, pending broad third-party replication

Evidence
Terminal-Bench 2.176.2% on terminal/command-line agentic tasks (official claim, beats Gemini 3.1 Pro)
MCP Atlas83.6% multi-tool orchestration (official claim, above Gemini 3.1 Pro's 78.2%)
highVerified: 2026-06-10
task accuracy reasoning

Official agentic and multimodal reasoning benchmarks from launch materials

Evidence
Finance Agent v257.9% on multi-step financial agent tasks (official claim)
CharXiv84.2% on chart reasoning (official claim)
mediumVerified: 2026-06-10
task accuracy general

Cross-benchmark comparison against Gemini 3.1 Pro from official launch claims

Evidence
Google I/O 2026 LaunchPositioned as 'frontier workhorse' matching or beating prior Pro tier on agentic suites
mediumVerified: 2026-06-10
output consistency

Consistency assessment based on GA status and vendor throughput claims

Evidence
Google DeepMind Models PageGA status with ~4x speed of Gemini 3.1 Pro at consistent quality
mediumVerified: 2026-06-10
latency p50

Relative speed claims from official materials; absolute latency varies by workload

Evidence
Google launch materials~4x faster than Gemini 3.1 Pro on agentic workloads
mediumVerified: 2026-06-10
context window

Official specification from provider documentation

Evidence
Gemini API Changelog1,048,576 input tokens, 64K output tokens
highVerified: 2026-06-10
uptime

Historical uptime data from official status page

Evidence
Google Cloud Status99.9% uptime (last 90 days, Vertex AI)
highVerified: 2026-06-10
🛡️Security
+

Standard Gemini 3.x-family security posture on Google Cloud infrastructure. High-speed agentic use increases the importance of downstream tool sandboxing.

prompt injection resistance

OWASP LLM01 testing and vendor documentation review

Evidence
Google AI SafetyGemini 3.x-generation prompt injection defenses
mediumVerified: 2026-06-10
jailbreak resistance

Adversarial prompt testing

Evidence
Google Safety TestingAdversarial robustness consistent with Gemini 3.x family
mediumVerified: 2026-06-10
data leakage prevention

Privacy policy and API terms review

Evidence
Gemini API TermsPaid-tier API data not used for training
mediumVerified: 2026-06-10
output safety

Safety filter testing across content categories

Evidence
Google Safety FiltersConfigurable multi-category safety filters
highVerified: 2026-06-10
api security

API security feature review

Evidence
Google Cloud SecurityGoogle Cloud security standards, Vertex AI IAM
highVerified: 2026-06-10
🔒Privacy & Compliance
+

Same Google Cloud compliance envelope as the Pro tier: SOC/ISO certifications, GDPR, HIPAA via Google Cloud, EU residency options.

data residency

Cloud infrastructure documentation review

Evidence
Google Cloud LocationsRegional deployment options via Vertex AI
highVerified: 2026-06-10
training data optout

Terms of service review

Evidence
Gemini API TermsPaid API data not used for training
highVerified: 2026-06-10
data retention

Data retention policy review

Evidence
Google Cloud Service TermsEnterprise zero-retention configurations available
mediumVerified: 2026-06-10
pii handling

Data protection capability review

Evidence
Google AI DocumentationCustomer responsible for PII redaction; Cloud DLP integration available
mediumVerified: 2026-06-10
compliance certifications

Certification verification

Evidence
Google Cloud ComplianceSOC 1/2/3, ISO 27001/27017/27018, GDPR, HIPAA (via Google Cloud)
highVerified: 2026-06-10
zero data retention

Enterprise feature review

Evidence
Vertex AI Data GovernanceZero-retention configuration available for enterprise customers
mediumVerified: 2026-06-10
👁️Trust & Transparency
+

Solid documentation at launch, but the model is only ~3 weeks GA; most benchmark figures remain vendor-reported and independent verification is still accumulating.

explainability

Reasoning transparency evaluation

Evidence
Gemini API DocumentationThinking traces available, though shallower than Pro-tier extended reasoning
mediumVerified: 2026-06-10
hallucination rate

Benchmark-derived grounding assessment; vendor claims pending independent replication

Evidence
CharXiv (chart grounding)84.2% chart reasoning suggests solid grounding; speed-optimized tiers historically hallucinate slightly more
mediumVerified: 2026-06-10
bias fairness

Bias benchmark evaluation and policy review

Evidence
Google AI PrinciplesRegular bias testing and mitigation per AI Principles
mediumVerified: 2026-06-10
uncertainty quantification

Qualitative assessment; limited data given three weeks since GA

Evidence
Model BehaviorLimited independent assessment so far; recent GA release
lowVerified: 2026-06-10
model card quality

Documentation completeness review

Evidence
Gemini DocumentationLaunch documentation with benchmarks, specs, and pricing
highVerified: 2026-06-10
training data transparency

Public disclosure review

Evidence
Google AI BlogGeneral training description; detailed sources not disclosed
mediumVerified: 2026-06-10
guardrails

Safety mechanism analysis

Evidence
Safety SettingsConfigurable multi-category safety guardrails
highVerified: 2026-06-10
⚙️Operational Excellence
+

Full Google Cloud operational stack from day one. Pricing ($1.50/$9.00, cached input $0.15) is notably higher than past Flash tiers, narrowing the cost gap to Pro models.

api design quality

API design and feature completeness review

Evidence
Gemini APIRESTful API with streaming, function calling, multimodal, context caching
highVerified: 2026-06-10
sdk quality

SDK quality and maintenance assessment

Evidence
Google Gen AI SDKsUnified Gen AI SDKs across Python, Node.js, Go, Java
highVerified: 2026-06-10
versioning policy

Versioning and changelog review

Evidence
Gemini API ChangelogGA at I/O 2026 with documented model lifecycle
highVerified: 2026-06-10
monitoring observability

Observability tooling review

Evidence
Google Cloud ConsoleComprehensive Cloud Console and Vertex AI monitoring
highVerified: 2026-06-10
support quality

Support channel assessment

Evidence
Google Cloud SupportEnterprise support tiers with SLAs
highVerified: 2026-06-10
ecosystem maturity

Ecosystem and integration analysis

Evidence
Google AI EcosystemDay-one availability across AI Studio, Vertex AI, and Gemini app
highVerified: 2026-06-10
license terms

License terms review

Evidence
Google Cloud TermsStandard commercial terms; enterprise agreements available
highVerified: 2026-06-10
Strengths
  • +Beats Gemini 3.1 Pro on agentic/coding suites: 76.2% Terminal-Bench 2.1, 83.6% MCP Atlas
  • +~4x faster than Gemini 3.1 Pro — strong agent-loop economics
  • +1,048,576 token input context with 64K output
  • +Context caching at $0.15 per 1M cuts repeated-prefix costs dramatically
  • +GA from day one across AI Studio, Vertex AI, and Gemini app
  • +Full Google Cloud compliance envelope (SOC/ISO, GDPR, HIPAA via GCP)
Limitations
  • !Pricier than past Flash tiers ($1.50/$9.00 vs historical sub-dollar Flash pricing)
  • !Benchmark figures are official claims; independent replication still accumulating (~3 weeks since GA)
  • !Deepest reasoning tasks still favor Pro-tier models
  • !Only ~3 weeks of production track record
  • !Training data transparency limited (industry standard)
Metadata
pricing
input: $1.50 per 1M tokens
output: $9.00 per 1M tokens
notes: Cached input $0.15 per 1M. Pricier than past Flash tiers, reflecting 'frontier workhorse' positioning.
last verified: 2026-06-10
context window: 1048576
max output: 65536
languages
0: English
1: 100+ languages
modalities
0: text
1: vision
2: audio
3: video
api endpoint: https://generativelanguage.googleapis.com/v1beta/models
open source: false
architecture: Speed-optimized multimodal transformer with thinking support
parameters: Not disclosed
knowledge cutoff: Early 2026 (not officially confirmed)
release date: 2026-05-19

Use Case Ratings

code generation

76.2% Terminal-Bench 2.1 and 83.6% MCP Atlas — beats Gemini 3.1 Pro on agentic coding at ~4x speed. Excellent agent-loop economics.

customer support

Speed plus frontier quality is ideal for high-volume support. Multimodal input handles screenshots.

data analysis

84.2% CharXiv chart reasoning and 1M context for large datasets at workhorse cost.

financial analysis

57.9% Finance Agent v2 is strong for an agentic suite, but high-stakes analysis still favors Pro-tier reasoning.

research assistant

1M context and fast iteration; deepest reasoning tasks still favor Gemini 3.1 Pro.

content creation

Fast, capable long-form generation for production content pipelines.

education

Low latency suits interactive tutoring; strong multimodal explanations.