A queueing model approach to measuring web server performance
- Performance = P{user will remain at the server until its service completes}
- Decreasing function of response time
- Service times ~ exp(mu), i.i.d, proportional to sizes of requested files
- Request arrivals ~ Poisson(lambda)
- Server is unaware if/when requests are aborted
- Server alternates between busy cycles, when there is at least one request in the system, and idle cycles (state 0), when there are no requests waiting or in service. These cycles are i.i.d.
- Switching time between requests is negligible