Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
33 changes: 14 additions & 19 deletions packages/app/src/components/page-content.tsx
Original file line number Diff line number Diff line change
Expand Up @@ -256,31 +256,26 @@ export function PageContent({ initialTab = 'inference' }: { initialTab?: string
<section>
<Card data-testid="intro-section">
<h2 className="text-lg font-semibold mb-2">
LLM inference performance is a major concern in providing AI services, but
accurate performance analysis remains elusive.
Open Source Continuous Inference Standard and Research Platform trusted by
Operators of Trillion Dollar Token Factories
</h2>
<p className="text-muted-foreground mb-2">
The fast cadence of software development and model releases makes comparing
performance between setups difficult. Existing performance benchmarks quickly
become obsolete because they are static, and participants often game the
benchmarks with unrealistic, highly specific configurations.
As the world progresses exponentially towards AGI, software development and model
releases move at the speed of light. Existing benchmarks rapidly become obsolete
due to their static nature, and participants often submit software images purpose
built for the benchmark itself which do not reflect real world performance
</p>
<p className="text-muted-foreground mb-2">
<strong>InferenceX</strong> (formerly InferenceMAX) addresses these issues by
benchmarking popular models on major hardware platforms, updating benchmarks as
new software versions are released.
</p>
<p className="text-muted-foreground mb-2">
For each model and hardware combination, <strong>InferenceX</strong> sweeps
through different tensor parallel sizes and maximum concurrent requests,
presenting a throughput vs. latency graph for a complete picture. In terms of
software configurations, we ensure they are broadly applicable across different
serving scenarios, and we open-source the repo to encourage community
contributions.
<strong>InferenceX</strong> (formerly InferenceMAX) is our independent,
reproducible benchmark which addresses these issues by continously benchmarking
inference software across an wide range of AI accelerators that is acutally
available to the the ML community. We continously update the benchmarks to capture
the speed of progress.
</p>
<p className="text-muted-foreground">
We hope <strong>InferenceX</strong> provides the community with up-to-date and
realistic LLM inference performance insights. Learn more in our articles:{' '}
Our open data & insights is widely adopted by the ML community, capacity planning
strategy teams at trillion dollar token factories & AI Labs & at multiple billion
dollar NeoClouds. Learn more in our articles:{' '}
<a
href="https://newsletter.semianalysis.com/p/inferencemax-open-source-inference"
target="_blank"
Expand Down