🚀 About Groupon:
Groupon connects millions of customers with deals from over a million local businesses worldwide. We're a dynamic company, big enough to offer significant resources and scale, yet small enough to allow for individual autonomy and impactful contributions. We're passionate about helping local businesses thrive and fostering a culture of innovation and success.
🖥️ The Role:
As a Senior Production Service Support Engineer, you'll be a key player in ensuring the smooth operation of Groupon's internal systems. You'll leverage Site Reliability Engineering best practices and the ITIL Solutions Architecture framework to develop and implement incident management strategies.
Responsibilities:
- Act as Incident Commander, change manager, and senior technical resource for site/service impacting incidents across Groupon's 300+ globally dispersed services.
- Prevent, identify, triage, document, investigate, mitigate, and recover from incidents.
- Coordinate resolution of Post Mortems and oversee Problem Management.
- Dedicate project time to engaging projects.
- Work as part of the Incident Management team (Monday-Friday shifts with one weekend primary on-call every 10 weeks).
💪 What We're Looking For:
- Essential Skills & Experience:
- 6+ years administering Linux system environments and performing root cause analysis of site-impacting issues.
- 4+ years creating Splunk or Kibana search queries to identify, resolve, and prevent incidents and outages. Experience owning impacting events until resolution, including coordination with Subject Matter Experts, task triage, documentation, action items, and Post Mortems.
- 6+ years experience with web application operations and root cause analysis.
- 6+ years developing policies and procedures to improve production stability.
- Excellent communication, consulting, and collaboration skills for interfacing with senior leadership.
- Preferred Skills & Experience:
- Experience with Python, Ruby, or Java.
- BS, MS, or PhD in Computer Science or a related field.
- Experience designing and creating tools for site and service management.
🌟 We Value Engineers Who Are:
- Customer-focused
- Team players
- Fast learners
- Pragmatic
- Proactive
⚙️ Our Infrastructure Ecosystem:
- AWS/GCP Environment
- Docker and Kubernetes
- Elasticsearch
- Pingdom, Opsgenie, Kibana, and Wavefront monitoring tools
- GitHub and Jira
- Java, Ruby, Node.js, and Next.js
- MySQL and PG databases
- Redis and Memcached
- Akamai CDN
- Python Tooling
🌎 Location: Remote (US Time Zone, 6 PM - 2 AM CET)
Important Note: Groupon's recruitment process is merit-based and does not involve any fees. Beware of recruitment fraud; always verify job postings through Groupon's official career website: grouponcareers.com