* lower premium concurrency in preparation for key+IP limits * include the ip in the user semaphore * 3, not 5 this is our current limit for free * per user_id+ip rate limiting