fixed links

AseelOmer · AseelOmer · commit ddb15d1361dd · 2025-11-07T00:12:27.000+02:00
diff --git a/commercial_models/models.md b/commercial_models/models.md
@@ -1,3 +1,7 @@
+<!-- markdownlint-disable MD013 -->
+<!-- markdownlint-disable MD025 -->
+<!-- Disabled MD025 because multiple top-level headings (#) are needed for each model section -->
+
 # Model 1: GPT-4 (OpenAI)
 
 ## Model Name and Provider
@@ -14,252 +18,197 @@ Cloud infrastructure uses global data centers; regions are not public.
 
 ### Estimated Energy (Inference)
 
-Published or estimated per-query energy values vary between studies.
+Published or estimated per-query energy values vary between studies.  
 Representative numbers include:
 
-**Epoch AI (2024):** ≈ 0.3 Wh (0.0003 kWh) per ChatGPT/GPT-4 query.
-
+**Epoch AI (2024):** ≈ 0.3 Wh (0.0003 kWh) per ChatGPT/GPT-4 query.  
 Source: [Epoch AI – How Much Energy Does ChatGPT Use?][epoch-ai].
 
-Other analysts estimate ≈ 0.3 – 1.8 Wh (0.0003 – 0.0018 kWh)
+Other analysts estimate ≈ 0.3 – 1.8 Wh (0.0003 – 0.0018 kWh)  
 depending on prompt length, token output, and GPU hardware.
 
-Sources: “The Carbon Footprint of ChatGPT,” media analyses.
-
 **Caveat:** OpenAI does not publish per-query energy data.  
 All estimates depend on assumptions such as:
 
-* Hardware type (GPU vs TPU)
-* Power Usage Effectiveness (PUE)
-* Data center region and carbon intensity
-* Prompt and token length
+- Hardware type (GPU vs TPU)
+- Power Usage Effectiveness (PUE)
+- Data center region and carbon intensity
+- Prompt and token length
 
 ### Training Energy (GPT-4)
 
-Some analyses extrapolate GPT-4’s training energy from model size and
-compute budget:
-
-≈ 51 – 62 GWh (51 772 500 – 62 318 750 kWh) for full-scale training.
+Some analyses extrapolate GPT-4’s training energy from model size and compute budget:
 
+≈ 51 – 62 GWh (51 772 500 – 62 318 750 kWh) for full-scale training.  
 Source: [The Carbon Footprint of ChatGPT][sustainability-numbers].
 
 These are indirect estimates, not official OpenAI disclosures.
 
-### Model Size (GPT-4)
-
-Estimated model size: **≈ 1.8 trillion parameters** (widely reported
-estimate; OpenAI has not publicly confirmed exact parameter count).
+### Model Size
 
+Estimated model size: **≈ 1.8 trillion parameters**  
+(widely reported estimate; OpenAI has not publicly confirmed exact parameter count).  
 Source: SemiAnalysis and other architecture analyses.
 
-### Water Usage (GPT-4)
+### Water Usage
 
 Official data are unavailable, but media analyses suggest:
 
-A single ChatGPT query may indirectly consume ≈ 0.5 L of water,
-depending on data-center cooling.
-
-Generating a 100-word email may use ≈ 0.14 kWh energy and 0.52 L water.
+- A single ChatGPT query may indirectly consume ≈ 0.5 L of water.  
+- Generating a 100-word email may use ≈ 0.14 kWh energy and 0.52 L water.
 
 Source: [The Verge – Sam Altman on ChatGPT Energy and Water Use][verge-gpt].
 
-### PUE and CI Context (GPT-4)
+### PUE and CI Context
 
 Studies multiply compute energy by:
 
-* **PUE** – Power Usage Effectiveness (total facility power / IT power)
-* **CI** – Carbon Intensity (kg CO₂e / kWh electricity)
+- **PUE** – Power Usage Effectiveness (total facility power / IT power)  
+- **CI** – Carbon Intensity (kg CO₂e / kWh electricity)
 
 Example assumptions:
 
-* **PUE:** ≈ 1.1 – 1.3 for Azure hyperscale centers  
-* **CI:** ≈ 0.3 – 0.4 kg CO₂e / kWh (depending on region)
+- **PUE:** ≈ 1.1 – 1.3 for Azure hyperscale centers  
+- **CI:** ≈ 0.3 – 0.4 kg CO₂e / kWh (depending on region)
 
 ---
 
-## Model 2: Claude Haiku (Anthropic)
+# Model 2: Claude 3 Haiku (Anthropic)
 
-### Model Name & Provider
+## Model Name & Provider
 
 **Claude 3 Haiku**, developed by **Anthropic**.
 
 ### Model Description
 
 Part of Anthropic’s Claude 3 family (Haiku, Sonnet, Opus).  
-Released March 2024. Smallest and fastest model for low-latency,
-energy-efficient inference in chat, summarization, and automation.
+Released March 2024. Smallest and fastest model for low-latency, energy-efficient inference in chat, summarization, and automation.
 
 Source: [Anthropic Blog – Claude 3 Technical Overview][anthropic-blog].
 
-### Model Size / Architecture
-
-Estimated model size: **≈ 7 billion parameters** (Haiku variant,
-optimized for efficiency and low-latency inference).
+### Model Size and Architecture
 
+Estimated model size: **≈ 7 billion parameters** (Haiku variant, optimized for efficiency and low-latency inference).  
 Source: public model reports and community discussions.
 
 ### Hosting & Deployment
 
-Hosted via Anthropic API and **Amazon Bedrock (AWS)**.
+Hosted via Anthropic API and **Amazon Bedrock (AWS)**.  
 These centers maintain **PUE ≈ 1.2**.
 
-Sources: [AWS Bedrock Claude Integration], [AWS Sustainability Report 2024][aws-report].
+Sources: [AWS Bedrock Claude Integration][aws-bedrock], [AWS Sustainability Report 2024][aws-report].
 
 ### Estimated Energy
 
-Anthropic does not publish per-query energy data.
-Independent analysts estimate ≈ 0.05 – 0.1 Wh (0.00005 – 0.0001 kWh)
-per query based on token count and GPU efficiency.
+Anthropic does not publish per-query energy data.  
+Independent analysts estimate ≈ 0.05 – 0.1 Wh (0.00005 – 0.0001 kWh) per query based on token count and GPU efficiency.
 
-Claude 3 Haiku is ≈ 5× faster and more efficient than larger Claude 3
-models.
+Claude 3 Haiku is ≈ 5× faster and more efficient than larger Claude 3 models.
 
-Sources: [Epoch AI – Energy Use of AI Models]Sources:
-[epoch-ai-training], [Anthropic Claude 3 Announcement].
+Sources: [Epoch AI – AI Training Compute and Energy Scaling][epoch-ai-training], [Anthropic Claude 3 Announcement][anthropic-announcement].
 
 ### Training Energy
 
-Claude 3 models are trained on GPU clusters (NVIDIA A100/H100) primarily
-hosted on AWS infrastructure.
-For models in the 10–30B parameter range, training energy is typically
-3,000–10,000 MWh.
+Claude 3 models are trained on GPU clusters (NVIDIA A100/H100) primarily hosted on AWS infrastructure.  
+For models in the 10 – 30 B parameter range, training energy is typically **3 000 – 10 000 MWh**.
 
-Sources: [Epoch AI – AI Training Compute & Energy Scaling],
-[Anthropic Responsible Scaling Policy].
+Sources: [Epoch AI – AI Training Compute and Energy Scaling][epoch-ai-training], [Anthropic Responsible Scaling Policy][anthropic-policy].
 
-### Water Usage
+### Water Usage of claude
 
-Anthropic has not published specific water consumption figures for the
-Claude 3 family.
-As it relies on AWS data centers, cooling water use is managed under AWS
-sustainability strategy.
-AWS data centers in cooler regions use air cooling to reduce water
-footprint, while others recycle water on-site.
+Anthropic has not published specific water consumption figures for the Claude 3 family.  
+As it relies on AWS data centers, cooling water use is managed under AWS sustainability strategy.  
+AWS data centers in cooler regions use air cooling to reduce water footprint, while others recycle water on-site.
 
-Sources: [AWS Water Stewardship Report][aws-water],
-[Anthropic Sustainability Commitments].
+Sources: [AWS Water Stewardship Report][aws-water], [Anthropic Sustainability Commitments][anthropic-sustainability].
 
-### PUE and CI Context
+### PUE & CI Context
 
-AWS’s average PUE: ~1.2 (accounts for cooling and power delivery losses).
-Carbon intensity (CI): ~0–0.2 kg CO₂e/kWh, depending on regional renewable
-mix.
-AWS aims for 100% renewable energy by 2025, lowering emissions over time.
+AWS’s average **PUE ≈ 1.2** (accounts for cooling and power delivery losses).  
+Carbon intensity (CI): ≈ 0 – 0.2 kg CO₂e / kWh, depending on regional renewable mix.  
+AWS aims for 100 % renewable energy by 2025.
 
-Sources: [AWS Global Infrastructure Efficiency Data],
-[Anthropic Responsible Scaling Policy][anthropic-policy].
+Sources: [AWS Global Infrastructure Efficiency Data][aws-efficiency], [Anthropic Responsible Scaling Policy][anthropic-policy].
 
 ---
 
-## Model 3: Gemini Nano (Google)
+# Model 3: Gemini Nano (Google)
 
-### Provider
+## Model Name / Provider
 
 **Gemini Nano**, developed by **Google DeepMind**.  
-Smallest member of Gemini family (Nano, Pro, Ultra).
+Smallest member of the Gemini family (Nano, Pro, Ultra).
 
-### Hosting
+### Hosting / Deployment
 
 Runs on-device via **Android AICore** (subsystem introduced 2023).  
-Designed for mobile hardware like Pixel 8 Pro and Pixel 9.  
+Designed for mobile hardware such as Pixel 8 Pro and Pixel 9.  
 Reduces energy use by eliminating cloud compute and network load.
 
-Sources: [Google AI Blog – Introducing Gemini][google-blog],
-[Android Developers – Gemini Nano Overview][android-dev],
-[The Verge – Gemini Nano on Pixel 8 Pro][verge-gemini].
-
-### Estimated Model Size / Architecture
+Sources: [Google AI Blog – Introducing Gemini][google-blog], [Android Developers – Gemini Nano Overview][android-dev], [The Verge – Gemini Nano on Pixel 8 Pro][verge-gemini].
 
-Gemini Nano variants (device-optimized):  
+### Model Size / Architecture
 
-* **Nano-1:** ≈ 1.8 billion parameters  
-* **Nano-2:** (larger device variant) ≈ 3.25 billion parameters
+Gemini Nano variants (device-optimized):
 
-These use quantized weights tuned for on-device inference.
+- **Nano-1:** ≈ 1.8 billion parameters  
+- **Nano-2:** ≈ 3.25 billion parameters  
 
-Source: device benchmark reports and public model parameter listings.
+These use quantized weights tuned for on-device inference.  
+Source: device benchmark reports and public parameter listings.
 
-### Estimated Energy (Inference) gemini
+### Estimated Energy of gemini
 
 No official values.  
-Device benchmarks show ≈ 0.01 Wh (0.00001 kWh) per query —  
-10 – 30× more efficient than GPT-4.
+Device benchmarks show ≈ 0.01 Wh (0.00001 kWh) per query — 10 – 30× more efficient than GPT-4.
 
-Sources: [Google Pixel AI Benchmarks (2024)],
-[Epoch AI – How Much Energy Does ChatGPT Use][epoch-ai].
+Sources: [Google Pixel AI Benchmarks (2024)][google-pixel-ai], [Epoch AI – How Much Energy Does ChatGPT Use][epoch-ai].
 
-### Training Energy Estimates
+### Training Energy of gemini
 
-Gemini Nano was distilled from larger Gemini models trained on **TPU v5e**
-clusters.
-Training energy for Nano ≈ 200 – 1,200 MWh (≈ 1–5% of Gemini Ultra’s
-training compute).
+Gemini Nano was distilled from larger Gemini models trained on **TPU v5e** clusters.  
+Training energy for Nano ≈ 200 – 1 200 MWh (≈ 1 – 5 % of Gemini Ultra’s training compute).
 
-Sources: [Google Research – Efficient TPU Training (2024)],
-[Google Cloud Sustainability Report (2024)].
+Sources: [Google Research – Efficient TPU Training (2024)][google-tpu-paper], [Google Cloud Sustainability Report (2024)][google-cloud-sustainability].
 
-### Water Usage (Nano)
+### Water Usage of gemini
 
-Inference uses no data-center water since it runs locally on devices.
-Training used Google data centers with Water Usage Effectiveness (WUE)
-≈ 0.18 L/kWh.
+Inference uses no data-center water since it runs locally on devices.  
+Training used Google data centers with Water Usage Effectiveness (WUE) ≈ 0.18 L/kWh.  
 Google targets net-positive water impact by 2030.
 
-Sources: [Google Environmental Report (2024)],
-[Bloomberg – Google AI Water Consumption (2024)].
+Sources: [Google Environmental Report (2024)][google-env-report], [Bloomberg – Google AI’s Thirst for Water][bloomberg-water].
 
-### PUE & CI Context
+### PUE / CI Context
 
-Google Data Centers report average PUE ≈ 1.10–1.12.
-Carbon Intensity (CI) ≈ 0.15 kg CO₂e / kWh due to 70%+ renewable energy mix.
+Google data centers report average **PUE ≈ 1.10 – 1.12**.  
+Carbon Intensity (CI) ≈ 0.15 kg CO₂e / kWh due to 70 %+ renewable energy mix.  
 On-device execution uses < 5 W of mobile power per inference.
 
-Sources: [Google Data Center Efficiency Overview (2024)],
-[Google TPU v5e Efficiency Blog (2024)].
+Sources: [Google Data Center Efficiency Overview (2024)][google-efficiency], [Google TPU v5e Efficiency Blog (2024)][google-tpu-blog].
 
 ---
 
-[azure-blog]:
-https://azure.microsoft.com/en-us/blog/introducing-gpt4-in-azure-openai-service/
-[epoch-ai]:
-https://epoch.ai/gradient-updates/how-much-energy-does-chatgpt-use
-[sustainability-numbers]:
-https://www.sustainabilitybynumbers.com/p/carbon-footprint-chatgpt
-[verge-gpt]:
-https://www.theverge.com/2023/1/18/energy-water-chatgpt
-[anthropic-blog]:
-https://www.anthropic.com/blog/claude3-overview
-[aws-report]:
-https://aws.amazon.com/about-aws/sustainability/
-[anthropic-policy]:
-https://www.anthropic.com/responsible-scaling
-[aws-water]:
-https://aws.amazon.com/about-aws/sustainability/#water
-[google-blog]:
-https://blog.google/technology/ai/google-gemini-ai/
-[android-dev]:
-https://developer.android.com/ai/gemini-nano
-[verge-gemini]:
-https://www.theverge.com/2023/12/6/23990823/google-gemini-ai-models-nano-pro-ultra
-[AWS Bedrock Claude Integration]:
-https://aws.amazon.com/bedrock/
-[Anthropic Claude 3 Announcement]:
-https://www.anthropic.com/news/claude-3-models
-[epoch-ai-training]:
-https://epoch.ai/gradient-updates/ai-training-compute-energy-scaling
-[Anthropic Sustainability Commitments]:
-https://www.anthropic.com/sustainability
-[AWS Global Infrastructure Efficiency Data]:
-https://aws.amazon.com/about-aws/sustainability/
-[Google Pixel AI Benchmarks (2024)]:
-https://ai.google/discover/pixel-ai/
-[Google Research – Efficient TPU Training (2024)]:
-https://arxiv.org/abs/2408.15734
-[Google Cloud Sustainability Report (2024)]:
-https://sustainability.google/reports/environmental-report-2024/
-[Bloomberg – Google AI Water Consumption (2024)]:
-https://www.bloomberg.com/news/articles/2024-02-13/google-ai-water-consumption-analysis
-[Google Data Center Efficiency Overview (2024)]:
-https://cloud.google.com/sustainability/data-centers
-[Google TPU v5e Efficiency Blog (2024)]:
-https://cloud.google.com/blog/products/ai-machine-learning/introducing-tpu-v5e
+[azure-blog]: https://azure.microsoft.com/en-us/blog/introducing-gpt4-in-azure-openai-service/
+[epoch-ai]: https://epoch.ai/gradient-updates/how-much-energy-does-chatgpt-use
+[sustainability-numbers]: https://www.sustainabilitybynumbers.com/p/carbon-footprint-chatgpt
+[verge-gpt]: https://www.theverge.com/2023/4/19/openai-ceo-sam-altman-chatgpt-energy-water-use
+[anthropic-blog]: https://www.anthropic.com/news/claude-3-family
+[aws-bedrock]: https://aws.amazon.com/bedrock/
+[aws-report]: https://aws.amazon.com/about-aws/sustainability/
+[anthropic-announcement]: https://www.anthropic.com/news/claude-3-models
+[epoch-ai-training]: https://epoch.ai/gradient-updates/ai-training-compute-energy-scaling
+[anthropic-policy]: https://www.anthropic.com/news/responsible-scaling-policy
+[aws-water]: https://aws.amazon.com/about-aws/sustainability/#water
+[anthropic-sustainability]: https://www.anthropic.com/sustainability
+[aws-efficiency]: https://aws.amazon.com/about-aws/sustainability/
+[google-blog]: https://blog.google/technology/ai/google-gemini-ai/
+[android-dev]: https://developer.android.com/ai/gemini-nano
+[verge-gemini]: https://www.theverge.com/2023/12/6/23990823/google-gemini-ai-models-nano-pro-ultra
+[google-pixel-ai]: https://ai.google/discover/pixel-ai/
+[google-tpu-paper]: https://arxiv.org/abs/2408.15734
+[google-cloud-sustainability]: https://sustainability.google/reports/environmental-report-2024/
+[google-env-report]: https://sustainability.google/reports/environmental-report-2024/
+[bloomberg-water]: https://www.bloomberg.com/news/articles/2023-08-09/google-ai-s-thirst-for-water-could-leave-towns-dry
+[google-efficiency]: https://cloud.google.com/sustainability/data-centers
+[google-tpu-blog]: https://cloud.google.com/blog/products/ai-machine-learning/introducing-tpu-v5e