A queueing model approach to measuring web server performance

Performance = P{user will remain at the server until its service completes}
Decreasing function of response time
Service times ~ exp(mu), i.i.d, proportional to sizes of requested files
Request arrivals ~ Poisson(lambda)
Server is unaware if/when requests are aborted
Server alternates between busy cycles, when there is at least one request in the system, and idle cycles (state 0), when there are no requests waiting or in service. These cycles are i.i.d.
Switching time between requests is negligible