metrics

Requests per second (RPS)

RPS measures how many requests a system or test sends each second; it is useful only when paired with latency, errors, and request mix.

Also known as: RPS, throughput

Definition

Requests per second is a throughput metric: how many requests are sent or completed in one second. It is often the headline number for API and web load tests.

In Maxoperf

Maxoperf surfaces throughput alongside latency percentiles, error facets, labels, logs, and runner health. That context matters because one RPS number can hide a failing endpoint or a saturated runner.

Common pitfalls

Comparing RPS across tests with different request mixes.
Optimizing for throughput while p95 or p99 latency is already outside the release target.

FAQ

Is higher RPS always better?

No. High RPS with unacceptable latency or errors is a failed result. Throughput must be read beside user experience and correctness.

Can VUs and RPS both be used in a load profile?

Yes. VUs describe concurrency while RPS describes request rate; many teams use both to make the intended load shape explicit.