Skip to content

Latest commit

Β 

History

History
197 lines (128 loc) Β· 9.14 KB

File metadata and controls

197 lines (128 loc) Β· 9.14 KB

Comparative Environmental and Technical Overview of Modern AI Models

This document presents verified technical and environmental data for three leading AI models β€” OpenAI GPT-4, Anthropic Claude 3 Haiku, and Google Gemini Nano β€” focusing on energy, water, and sustainability context.


Model 1: GPT-4 (OpenAI)

Model Name & Provider

GPT-4, developed by OpenAI.

Hosting / Deployment

Hosted via Microsoft Azure OpenAI Service, which operates on Azure’s global data centers (specific regions not publicly disclosed).
Source: Introducing GPT-4 in Azure OpenAI Service – Microsoft Azure Blog

Estimated Model Size / Architecture

GPT-4 is widely considered a frontier model employing a Sparse Mixture-of-Experts (MoE) architecture.
This structure activates only a subset of parameters per inference, optimizing efficiency while maintaining scale.
Estimated total parameters exceed 1 trillion.

Sources:

Estimated Energy (Inference)

Estimates vary between studies:

Note: OpenAI has not released official inference energy data.
Values depend on hardware (GPU vs TPU), data-center PUE, and carbon intensity.

Training Energy Estimates

Extrapolated from compute budgets and model size:
β‰ˆ 51,772,500 – 62,318,750 kWh (β‰ˆ 51.8 – 62.3 GWh) for full-scale training.
Source: The Carbon Footprint of ChatGPT – Sustainability by Numbers

Water Usage

Water consumption derives from data-center cooling processes:

PUE / CI Context Used in Studies

Environmental analyses generally apply:

  • PUE (Power Usage Effectiveness): β‰ˆ 1.1 – 1.3 (Azure hyperscale data centers)
  • CI (Carbon Intensity): β‰ˆ 0.3 – 0.4 kg COβ‚‚e / kWh (depending on regional grid mix)

Model 2: Claude 3 Haiku (Anthropic)

Model Description

Claude 3 Haiku is the smallest and fastest member of Anthropic’s Claude 3 model family ( Haiku, Sonnet, Opus ), released March 2024.
It is optimized for low-latency, energy-efficient applications such as chatbots, summarization, and enterprise automation.

Sources:

Hosting / Deployment

Claude 3 Haiku is available through Anthropic’s API and AWS Bedrock.
AWS data centers maintain an average PUE of β‰ˆ 1.2.
Sources:

Estimated Model Size / Architecture

Community estimates place Claude 3 Haiku at β‰ˆ 20 B parameters.
The largest model in the family (Claude 3 Opus) is β‰ˆ 2 T parameters.
Source: ClaudeAI Community Discussion (Reddit)

Estimated Energy (Inference)

Anthropic does not publish per-query energy figures.
Based on 10–30 B parameter transformers: β‰ˆ 0.05 – 0.1 Wh (0.00005 – 0.0001 kWh) per query.
Haiku is β‰ˆ 5Γ— more efficient than Claude 3 Sonnet or Opus.
Sources:

Training Energy Estimates

Claude 3 models are trained on GPU clusters (NVIDIA A100/H100) via AWS.
Typical training energy for models of this scale: β‰ˆ 3,000 – 10,000 MWh.
Sources:

Water Usage

Anthropic does not publish direct figures; relies on AWS cooling efficiency and water recycling policies.
Sources:

PUE / CI Context Used in Studies


Model 3: Gemini Nano (Google DeepMind)

Model Name & Provider

Gemini Nano, developed by Google DeepMind, is the smallest member of the Gemini family (Nano, Pro, Ultra).
Sources:

Hosting / Deployment

Runs on-device through Android’s AICore system (launched in Android 14).
Deployed on optimized hardware (e.g., Pixel 8 Pro, Pixel 9 Series).
This local processing approach eliminates cloud compute energy and network latency.
Additional coverage: - Android Developers – Gemini Nano Overview

Estimated Model Size / Architecture

Deployed in quantized versions:

Estimated Energy (Inference)

Training Energy Estimates

Gemini Nano was distilled from larger Gemini models trained on Google TPU v5e clusters.
Training energy estimated β‰ˆ 200 – 1,200 MWh (total, amortized across billions of devices).
Sources:

Water Usage

PUE / CI Context Used in Studies


Summary

Model Developer Hosting Type Est. Parameters Inference Energy (Wh/query) Training Energy (MWh) PUE CI (kg COβ‚‚e/kWh)
GPT-4 OpenAI Cloud (Azure) β‰ˆ 1 T + (MoE) 0.3 – 1.8 β‰ˆ 51 K – 62 K MWh 1.1–1.3 0.3–0.4
Claude 3 Haiku Anthropic Cloud (AWS Bedrock) β‰ˆ 20 B 0.05 – 0.1 3 K – 10 K β‰ˆ 1.2 0–0.2
Gemini Nano Google DeepMind On-Device (Android AICore) 1.8–3.25 B β‰ˆ 0.01 (on-device) 200–1,200 1.10–1.12 β‰ˆ 0.03 g COβ‚‚e /query