Lead/Sr SRE
CyberCoders

Overland Park, Kansas

Posted in Retail


This job has expired.

Job Info


Lead/Sr SRE If you are a Lead/Sr SRE with experience, please read on!

Our client is scaling our engineering organization with the introduction of a Site Reliability and Release Management practice. This team will be focused on ensuring we're building, deploying and operating highly reliable, performant and secure applications on AWS.
What You Will Be Doing - Comply with all company and departmental policies and procedures
- Maintain responsibility for the design, deployment, and maintenance of production-scale systems.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
- Use automation to streamline the provisioning, management, and monitoring of applications and services using multiple scripting languages and infrastructure-as-code.
- Facilitate blameless Incident Retrospectives to understand root causes, communicate learnings, determine remediation and make us better and closer as a team.
- Coordinate with development and platform teams to design and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions, and code instrumentation.
- Coordinate with the architects to design a highly available solution that meets availability and reliability objectives and to reduce manual activities using automation, when feasible.
- Identifying, evaluating, and recommending monitoring tools and diagnostic techniques relevant to the application architecture. Assess gaps in as-is monitoring tool capabilities and recommend tools to augment or replace.
- Instrumenting applications to enable performance diagnostics and monitoring
- Collaborating with developers to promote the concept of reliability engineering during all phases of the SDLC to detect and correct performance issues earlier in the lifecycle
- Participating in re-architecture, redesign, and refactoring decisions to satisfy performance requirements
- Developing dashboards and reports to provide ongoing visibility into the performance of applications
- Other duties as required/assigned
What You Need for this Position - Bachelor's Degree or equivalent (minimum 12 years) work experience. (If Associates Degree, must have minimum 6 years work experience).
- AWS Certifications (Solution Architect, Developer, DevOps Engineer)
- Experience with Agile and DevOps
- Understanding or exposure to Chaos Engineering Tools (Chaos Toolkit, Gremlin, Simian Army, etc.)
- Experience with Infrastructure-as-code automation tool, including Cloud Formation and Terraform
- Experience with version control software, including Git
- Experience in calculating system reliability metrics, including RPO, RTO, SLO & SLI
- Experience with Containers (kubernetes and docker)
- Experience with logging solutions, including Datadog and Sumo Logic
- Knowledge of defining and monitoring system quality measures, including SLO and SLA
- Experience with distributed computing, Web Services, SOA, and JEE design concepts
- Experience delivering software designed for high concurrency, scalability, or availability
- Hands-on experience collecting performance data, analyzing, troubleshooting, and tuning
- Experience with different flavors of Linux, i.e. RedHat, Ubuntu, CentOS, etc.
- Built tooling to improve reliability of systems, automated remediation of issues, or improve scalability.
- Systems often need to be reconfigured, so you should have experience with a configuration management system like Puppet, Chef or Salt.
- Experience with usage of common application protocols and messages (e.g. TCP/IP, HTTP, SOAP, RESTful APIs, XML/JSON, JDBC, JMS/MQ)
- Exposure to Cloud, SaaS, and virtualization concepts and performance concerns
- Exposure to application threading and concurrency concerns
- Working knowledge of operating system design, processes, and threading model
- Ability to work in other languages such as JavaScript, Ruby, PHP, Perl, Python, PowerShell, and Linux shell scripting
- Experience with Amazon Web Services
What's In It for You - Competitive comp package ($170-200k/yr - may go higher for the right candidate)
- 401k w/match
- Full health insurance benefits (medical, dental & vision)
- Family oriented culture
- Autonomy
- Team collaboration
- 100% REMOTE
So, if you are a Lead/Sr SRE with experience, please apply today!
Colorado employees will receive paid sick leave. For additional information about available benefits, please contact Nick Valenti
- Applicants must be authorized to work in the U.S.



*CyberCoders, Inc is proud to be an Equal Opportunity Employer*



All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status, or any other characteristic protected by law.



*Your Right to Work* - In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.


This job has expired.

More Retail jobs


El Super
Corona, California
$21.00 per hour
Posted 43 minutes ago

Fiesta Mart
Fort Worth, Texas
Posted 43 minutes ago

El Super
Phoenix, Arizona
$16.50 per hour
Posted 43 minutes ago

Get Hired Faster

Subscribe to job alerts and upload your resume!

*By registering with our site, you agree to our
Terms and Privacy Policy.