2016-11-07

Thanks to our ongoing expansion we have the opportunity to grow our Cloud Site Reliability team. We're a part of the Elastic Cloud team with an operations background who aren’t afraid to get our hands dirty.  We are the first line of consumers for Elastic's products and our experience helps influence the direction of the product.   While most organizations may have a single or a handful of Elastic Stack deployments, here, you’ll be responsible for identifying, troubleshooting and reporting platform problems to developers in order to ensure that the thousands of Elasticsearch clusters that we manage are providing a stable and reliable service.  We’re looking for people who are just as excited about troubleshooting issues with distributed systems as they are to automate, code and collaborate to solve problems.

Responsibilities

Report and troubleshoot problems within the Elastic Cloud infrastructure services and collaborate on issues with developers

Handle day to day operations around the Elastic Cloud such as customer trouble tickets managing cloud provider infrastructure (maintenance/expansion), and software deployments

Develop and enhance tooling to deploy and manage the Elastic Cloud product and infrastructure

Demonstrate and promote best practices for teams using cloud platforms

Show more